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Abstract * . 
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primarily fron such sources as professional journals aSo^eports of 
government-funded projects. Abstracts in t^his issue relate to such 
topic areas as evaluation of elementary science curricula utilizing 
behavioral tasks, effectiveness of audio-tutorial "instruction, 
effectiveness of tutorial con/puter-assisted instruction, effect of 
verbalizers on achievement of non-verbalizeis in an enquiry 
/^classroom, • and a study of verbal behavior patterns 'in primary ^rade 
classrooms during science activities^ (EB) \ 
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NOTES 



from the Editor 



Beginning with Vol, 1, No. 4, Investigations in Science Education 
will reflect some changes suggested by the ISE Advisory Board. One 
of these involves an attempt to ''cluster" analyses of research reports 
around a central theme, topic, or investigative approach. This will 
permit a more in-depth examination of current research In science^ 
education focusing on the underlying theoretical framework. Another 
proposal includes the analyses of some research reports from diffeffing 
points of view (in-paradigm/out~of-paradigm) . Thus, a Piagetian study 
might be reviewed by an advocate of Piaget's views of cognitive develop- 
ment and by an individual holding contrasting views. These changes are 



not intended to supplant the review of individual reports of research, 
but rather to supplement theni, ' ^ * 

Again, we encourage your comments and suggestions for improving- 
Investigations in Science Education . 



Stanley L. Helgeson 
Editor 

-Patricia E*. Blosser 
Associate Editor 
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Alexander, Donald L. and Alan C. Donaldson, "Earth Science In West 
Virginia Schools." Journal of Geological Education , Vol. 20, 
No. 4:193-195, September, 1972. 

, Descriptors — Curriculum, *Earth Science, ^Educational Programs, 
^ Geology Research, Science Education, *Secondary School Science, 
Surveys i 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by. 
Victor J, Mayer', The Ohio State University. 



Purpose 

The stated purpose of the study was to determine the current 
status of earth science as a curriculum offering in the secondary 
schools of the state of West Virginia. It was implied, but not stated 
explicitly, that the study was carried out in 1971. 

Rationale 
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The introduction to the article implies that the junior author 
conducted a similar study. in 1957, reported in 1963, in which he found 
that only one West Virginia school offered earth science. The present 
study was initiated to determine the progress in school adoptions of 
earth science curricula in, the 14 years following that first study. 

Research Design and Procedure y ' ^ 

"Tha study was'' conducted in two stages. The authors first obtained 
a status repoialt on earth science offerings from the Department of Edu- 
cation of the state of West Virginia. They provide no indication as * 
to the nature and source of data for this report nor 'how the informa- 
tion wa& compiled. One imist assume that it did provide the names of 
those teachers in the state who were teaching earth science. 

In the second stage mor6 detailed information was obtained 
through the use of a questionnaire sent to every teacher of earth 
scienfce in the state. The questionnaire was designed to. assess: 

a) pupil attitude toward earth science; 

b) teacher attitude toward earlh science; y» 

c) teacher preparation in earth science ; 

d) materials being used by the teacher in ther classroom; 

e) me^thods being used by the teacher in his earth science 
classes ; and 

^ f) ^ rationale for the continued growth of earth science in 
West Virginia as expressed by the earth science teachers. 

Th^qi^tionnaire was sent to 107 teachers apparently identified 
by the State Department of Education as teachers of earth science. 
Responses were received from either 40^ or 42 teachers (both figures 
are cited at different places in the article), representing 39.6 
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percent of the population. Apparently no follow-up was attempted 
to gain greater response, nor was there any attempt to determine 
whether the non-respondepts differed in any way frpm those teachfers 
who did respond. 

The analysis consisted of tabulation of the data and conversion 
to percentages. 



Findings 

The major findings as cited by the authors are as follows: 

iV The number of schools offering earth science in the state 
increa?sed from one school in 1957 to 73 schools when the 
most recent survey was conducted (assumed to be 1970-71). 

2. Earth science is*. most commonly taught in, grades 7-9, with 
the highest number of students enrolled in the course in 
grade 9. ' 

3. Only four of the 40 respondents were teaching earth science 
* full-time. The remaining teachers taught other sciences, 

English, music or driver education in addition to earth 
science. v 

4. The response to the items on teacher background revealed 
" that 65 percent were certified to teach earth science. 

Twenty percent were not sure whether they were certified 
to teach earth scienpe. The authors did not provide infor- 
mation as to the nature of the state certification require- 
ments. The number of years teaching earth science averaged 
3,8. The most frequently indicated scietice area taken by 
the teachers was physical science, averaging 7 credits per 
teacher. Eleven teachers indicated no background in it. 
, Geology was also taken by all but 11 of the teachers, with 
the average number of hours being 5. The hours of an area 
^ taken weje averaged over the entire number of teachers 

responding, rather than on the basis of the number of 
teachers indicating having background in that area. This, 
gives a rather misleading impression of the depth of back- 
ground in each area. Also, it is not indicated whether 
the hours are expressed in semester or quarter hours of 
credit. 



Of thos'ei^teachers returning the questionnaire, 65 percent 
indicated that they woulO, like to take additional background 
in earth science; 60 percent wanted more geological field 
training, an area which the authors state is neglected in 
West Virginia. The teachers would prefer summer institutes 
to other forms of additional training (50 pei^cent responding 
favorably to that^J^tem on the questionnaire). 

Teachers were ^so asked to list the materials that they used 
in teaching earth science. The most frequently used textbook 
was Modem Earth Science, by Ramsey and Burkley (46.3 percent). 



Thfe Earth Science Curriculum Project (ESCP) textbook. 
Investigating the Earth was used by only three teachers 
(7.3 percent). The most frequently used equipment was 
film arid f ilmstrips (60, percent) ; topographic maps were 
used by 35 percent of the teachers; only 2 (5 percent) used 
the-ESCP laboratory equipment. Fifty-five percent of the 
teachers did not use laboratory sessions as an integral 
part of their earth science courfie. An explanation is sug- 
gested, according to the authors^ by the fact that 35 per-. 
. cent of the teachers felt that their laboratory facilities 
were %elow average" while 62.5 percent responded that their 
laboratory facilities were average or above average. Only 
37.5 percent of the teachers indicated that field trips 
wece integral parts of their courses. They* conducted an 
average of two per year . , 

6. Of the teachers responding to the questionnaire, 97.5 percent 
felt that earth science offerings in the state should be 
expanded. They felt that a knowledge of earth science was 
important in: "*(1) environmental understanding; (2) satis- 
faction of children's curiosity of local fi^ld area under- 
standing; (3) intellectual value; (4) training in using 
natural resources wisely; (5) understanding the effects of 
pollution; and (6) acquainting children with our home. Earth." 
The authors did not indicate if these reasons were the result 
of a free response option on the questionnaire or of rankings 
provided from forced choice items. There is also no indica- 
tion h(5w frequently each of these reasons was offered. 



Interpretations 

The authors conclude that there need to 1^ some drastic changes 
in the way ea^th^science is being taught in West Virginia, in light 
of the limited dfegree to which laboratory and field experiences are 
being offered by the teachers responding to the survey. They suggest 
that a possible answer to this problem would be to offer more field 
training for teachers. They^also conclude that earth science must be 
made available to more students in the state, since "only 4.9 percent 
of the public school students are currently exposed to an earth science 
course." In addition, improvement is needed in the physical facil- 
ities for earth science laboratories • and in .teacher preparation. 



Abstractor's Notes 

The study reported in this article should be a valuable source of 
information for educational planners in the state of West Virginia. 
It is also of interest to those in earth science education around the 
country in that it -documents the growth of earth science curriculum 
offerings in a state where problems associated with the utilization of 
earth resources are in the forefront of national attention. The in- 
formation would appear to be valid and fairly well documented. The 
report of the study, however, is replete with problems in interpireta- 
tion, manner of statement citation of data" and vagueness of relation- 
ships between conclusions and data. In general there is a failure to 
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supply enough information to allow the reader to evaluate the don- 
clusions which are drawn by 'the authors. ' 

The authors, in the introduction to th^ article and in a table 
on the same page, citi statistics on earth science enrollments in 
1957 and again in 1962-63. They do not stlpply any information 'on 

^how these statistics were gathered in the study being discussed. In 
the next section (entitled "Current, Status*')-*^ it is stated that infor- 
mat,ion on curpnt status was obtained from 'the West Virginia State* 
Department of Education, and the reader is left to assume that the 
information in the previous two studies came from the same source. 
If this is true, then, hopefully, comparisons can be made, p'Vovided 

^the state department has nob changed its methpds of acqtiir^Lng ,anii 
reporting school information in the interim.* The ^'authors also fail 
to state the procedure used by the State. Department of Education 'in 
|pquiring its information. If it is based 'on annual reports from 
principals, as it dsin Ohio, it could be highly reliable, but the 
reader' does 'not know. In the same sec^ion"^ statemenr is made that ^ 
of 55 counties in West Virginia, 30 offer earth science. This 'would 
imply that West Virginia, unlike the rest of the' country, operates on 
a county-wide school system. What the authors probably meant to say 

.is that earth science is offered by schools located in 30 different 
counties. ^ . 

The* only major fault with the design is ari apparent failure to 
pilot the questionnaire and to dejtermine its reliability or validity. 
This is an unfortunate characteristic of many questionnaire studies. 
Further compounding the problem in this particular article is the 
failure to* state the questions us^ \n the qjiestionnaire . This could 
very readily have been done early in- each section tf the article 
relating to a particular question. . This would give the reader the 
opportunity of judging the validity of each* question and therefpre the 
quality of the information the question generated.* Apparently the 
authors failed to send ojit a ffollow-up questionnMre . Doing so could 
have increased their return substantially. In any case, they failed 
to determine if there were any substantial differences between the 
respondents and the non-respondents . This could readily have been 
done by randomly selecting ten or so n(5h-resppndents , obtaining their 
response to the questions over the telephone, and then comparing their 
esponses with the othera. If there were no great differences, than 
he conclusions of the study could be generalized for the entire 
population of West Virginia earth science teachers. As it is, the 
authors consistently conclude that the re^sults of this survey repre- 
sent the status of earth science in the state as a whole. ' While this 
may be true, the design does not permit such generalizations. 

There are numerous stateme4|-ts that bother the careful reade^^of 
the article. At the end of the section entitled VSurvey Questionnaire 
the statement is made that "The total population of 107 earth science 
teachers in West Virginia was sampled." What is me an t^ is that the 
questionnaire was sent to the entire population of earth science 
teachers in the sta^. In discussing tfhe preparation of earth science 
teachers as indicated by the survey, the statement is made that "The 
preparation of teachers in earth science is still far behind ,the 
preparation of teachers in other areas of science." Ona must assume 
that what is meant is that earth SiCience teachers have less background 



in earth science courses than other science teachers have: in their 
particuJjar, disciplines. Even if one makes this assumptioia, hofwever, 
it is still not clear if the statement means other science teachers 
in West Virginia as the referent group, or other science teachers 
nationwide. What is even mojce serious, the authors fail to cite the 
source of their knowledge on the preparation of other science teachers. 
A similar problem occurs in the section entitled '^Methods and Materials 
used in Teaching Earth Science." In discussing the failure of earth 
science teachers to include laboratory work in their courses, they 
state that "Marked decreases in learning and in the desire to learn 
take place when active student in"Vfolvement is not utilized in science 
courses." No studies are cited^which might lend support to such a 
statement. In fact, there appears to be little empirical support for 
this statement. Subsequently the authors attempt to explain the 
failure to use laboratory activities partly upon the teachers' per- 
ceptions of their laboratory facilities » This is based upon a question 
in which 35 percent of the respondents rated their facilities below 
average » and the remainder rated them average or above. One of the 
"advantages" of earth ^ience as a laboratory course is that it requires 
very minimal laboratory facilities ^-^gjlt is here, perhaps, where the 
bias of the authors begins to show through, especially when concluding 
that only 37.5 percent of the teachers were conducting an average of 
two field trips per year as an integral part of their course. 

One final error that is quite disturbing is f6und in the authors' 
conclusions where they state that only 4.9 percent of the public school 
students are currently exposed to an earth science course. This 
figure is apparently based upon the statement made at the beginning of 
the article that' 4.6 percent of the students in grades 7-12 .were taking 
earth science in the year that the report of the State Department of 
Education was compiled. The authors apparently made an error in quot- 
ing* the percentage. If. we assume that a student would take earth 
science only once in secondarj^-^schopl, ignoring repetitions due to 
failure and the availability of secon3^T:evel earth science courses, 
and that the relative proportions of students in the various- grade 
levels "remain constant between school districts throughout the state, 
then the maximum probability that a student would take earth science 
is: 6 (the six years that a student would be enroiLled in secondary 
school grade levels) times 0.046 (the probability that a student takes 
earth science in a given year) or 0.276. In other words, on,e would 
expect a maximum of ^7.6 percent of the graduates o^ high schools in 
the state of West \^irginia to have^ taken an earth science course while 
in the secondary schools. 
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Atwood, Ronald K., Mark R. Brox^ and Anna A. Neal, '^Evaluation of a 

Hybrid Elementary Science Curriculum Utilizing Behavioral 3*asks." 
School Science and Mathemat4.cs , Vol. 72, No. 7:641-646, October, 
1972. 

DescriptorsT-*Behavioral Objectives, *Curriculum Evaluatibn, 
. *Discovery Processes, Educational Research, Elementary Grades, 
*Elementary School Science, Evaluation, Instruction, Science 
Education 

A/ 

Expanded Abstract and Analysis Prepared Especially for l.S.E. by 
Glen 0. Blough, University ot Maryland. 



Purpose ■ 

•5) ■ ' 

To describe an initial effort to develop- a scheme that utilizes 

behavioral tasks for the evaluation of an experimental elementary 

science eurric;plum in a soT|ool with" modest resources. Hypotheses 
tested for group post test means were: 

1. The experimental primary group will ^score significantly 
higher (.05 level) than the control primary group 

2. The experimental intermediate group wMl s^oore signifi- 
cantly higher' (.05 level) than the control intermediate 
group. ^ . . 

3. The total, experimental group i^ll score fii^if icantl,y 
higher (.05 ^qvel) th\^ the to^al control groti^. 



Rationale I 

The twenty behavipral tasks for the^ primary level (readiness 
through grade 3) were selected from the SAPA Competency Measures 
with few and miiior , modifications . The^procesqes incliuded and the 
number of tasks used to evaluate the processes were: observing (3), 
classifying (3) ^ using numbers^ (2), space-^time relations (3), com- 
municating (4) , measuring (4) , and predicting (1) . 

For the intermediate level- (grades 4 through 6) fifteen tasks 
were selected from the SAPA Competency Measures with few and minor 
modifications, and five similar tasks were prepared, by the investi- 
gators. The processes included and the number pf talks' used to 
evaluate the processes were: observing (=•2), classifj^i-ng (2), using 
numbers (2), space-time relations (1), communicating (3), measuring 
(5), predicting (1), inferring (1), control df variables (1), and 
design of experiments (2)^, The last ten taskip used with the* primary 
group were identica the first ten tasks ijsed with the IntermedJ.- 
ate group. The philosophy and general objectives |of the school ' 
system provided the basis for task selection. 
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Research Bealgn and Procedure . 

The Fayette Coiihty Schools » Lexington, Kentucky, decided in the 
Fall *l)f 1969 to pilot Science, A Process Approach (SAPA) and Elementary 
Science Study (ESS) . .Because a '*whole-school" approach was preferred 
and funds were rather limited, SAPA was implemented at the primary 
level, and ESS at the intermediate level In a single small urban ele- 
mentary school. A school comparable in terms of- student socio-economic 
levels, enrollment, racial balance, curricula (otheif than science) , 
and teacher preparation was available to serve as a control. The 
science curriculum in the control school rejied heavily on a con tent - 
' centered text. " ^ 

- -a 

Since the philosophy ftot science instruction officially adopted 
. by the Board of Education clearly favored process-oriented education, 
as' opposed to information storage process-oriented tasks were chosen 
as the most meaningful mode of tpfeasuring student performance. 

Recognizing task administration on an ^dividual basis to be very 
time consuming, a random sample df approximately 20 percent of the 
school enrollment was chosen l^y grade level in both e:t]perimental and v 
control schools. One investigator administered *the tasks, both pre 
and post, to all primary. leveS subjects. Another investigator admin-. 
istered the tasks,* both pre and post, to all intermediate level sjib-ir 
jects. Performadce on each task was scorjed as either' acceptable or 
, non-acceptable ^^ased on specific criteria, and each student's score 
represented his ^ymber ot acceptable resteonses. Pretesting was done 
during early October, 3,969, and the post^=a<^nistration was completed 
in early liay of 1970. 

Tlie teaching staff of the experimentajj school deceived six days 
'of special igtjreparation t^rior to the opening of school, and tl^r^e days . 
during the fchool year. Teacher Corps interns and special education- 
faculty pa^rticipated . The control school faculty, which also included 
Teacher Corp^ interns and special education facility , received no pre- 
paratory or^Un-servlce experiences in elementary school science. 

- t ■ cr 

p Findings 

Two tables supply the data results. Table I is a data siiramary. 
The treatment means dndicate the arithmetic averages of the raw scores, 
the adjusted means were the result of post test scores with the pre 
test scores ^used as ^ covariate. In Table II. the adjusted means were 
. used in miaiysis of covariance to test the hypotheses. 
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TABLE I. DATA SUMMARY 





Grotip 


Level , 






Treatment 
"Mean 


Adjusted 
V Mean 




o 


PfdUtaary . v 




Pre 
Post 


5.48 
9.14 


8.97 


21 
21 


E2tp!e3?lmental 

\ \ 


. Intermediate 




Pre 

Post 


3.61 • 
6.39 


6.85 


23 
23 




^ Primary & 
Intermediate 




Pre 
Post 


4.50 
7.70 


7.85 


44 
44 


i 


Primary 




Pre 
Post 


* 5.00 
5.20 


5.38 


20 
20 


ft 

Control 


Interml^diate 




Pre 

Post 


4.72 
5.72 


5.30- 


, 25 
^25 




' Primary & 
Intermediate 




Pre 
,Post 


1 • — 

5.84 

5.49 


. 5.35 


45' 

45 




TABLE II, ANALYSIS 


OF COVARIANCE SUMMARY 




Groups 


Source 


df 


Mean-Square 


F 


P 


Primaty 


Treatment 
, Ertor 


1 
38 


130.68 
2.94^ 


44.50 

* 


.01 


Intermediate 


Treatment 
Error 


1 
45 


/ 27.00 . 
4.51 


5.99 


.05 


Primary & 
In terme delate 


Treatment 
Error 


1 
86 


137.97 
3.94 


35.04 


.01 




On the basis of these data the escperimenterG concluded that the 
eKperi^ental, groups out-performed the control groups on the behavioral 
tasks and all the hypotheses xuere accepted. They caution about inter- 
preting the difference of the levels of significance for the primary * 
and itttermediate groups since only ten of the twenty taslco were common 
to both levels and coaparisons made involved only eaiperlmental vs. 
control groups. 

The escperimente^s point out that the difficulty level of the tasks, 
of initial concern to the investigators, proved to be acceptable. All 
students scored above zero on pre and, post administrations, with a ^ 
total of five students scoring one. A total of five scores above 12 
'were recorded "with a single high score of 16 most closely approaching 
the maximum score of 20. Thus, "topping out" was not a problem in 
this, study and would, likely not becotae a problem were the same tasks 
utilized in a longitudinal study ejitending another year. 

EkiosDlnation of results foif each individual task provided useful ^ 
information. For esiample, on oiie task primary level subjects were 
asked to make a group or set of 0^.1 the large ilue t.:pianglos after 
they had been presented with d set of the ESS A" Blocka[V ^ On the pre- . 
test eight sub j acts in each group demonstrated competency while on 
the post- test 1? subjects in the e3q>erimental group and only seven 
subjects in the! contipol group demonstrated competency. 

They further point out that from some taslcs plus observations 
made while the tasks were being administered, it was* inferred that 
the language used to present the tasks masked the extent to which some 
process skills were develoi^d and, further, that this masking effect 
was greater fOr the control group, 

) 

Findings and Interpretations 

The escperimenters indicated that the assumption that, behavioral 
tasks provide a sound mechanism for assessing curricular outcomes 
judged to b^ especially important at the local level is supported by 
this study. Cbmparisons of experimental and control gtoups produced 
statiatically significant results in favor of the e:cperimental groups. 
Subjective eiiamination of results by task and clusters of tasks were ^ 
^considered fruitful. To strengt^ien these subjective analyses one ^ 
cOuld use chi square where cell frequencies are adequate. One of the 
real bonuses of. utilizing behavioral taslcs lies in l^ie insights 
individuals administering the tasljs gain in how child3^^n perceive and 
attack task solutions. ^ \ 

One of the greatest sources of difficulty is the language of 
the task, which can mask skill development. A more obvious difficulty 
rests with attempts to consistently administer the tasks. 

It should ^Iso be remembered that failure to demonstrate competency 
on a particular task provides limit^ information, since subordinate 
(enroute) behaviors may have been developed . ' Wot knowing the extent 

which this may have occurred is an unavoidable problem when any/ 
sample of tasks is utilized, but the problem can be expected to become 
larger as the sample of. tasks becomes smaller. 
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Abstractor's Notes 



While auch research ao this addo to the significant Imowledge of 
EOthodQ and procedures in science teaching and learning, a jcoigpiling 
and comparison of the results of similar studies seeiDS highly desir-. 
able before truly dependable results can accrue 4 

» In learning studies* as we all lcnox^> there, are many variables 
that are difficult to control effectively. For ejiample, children in 
the experimental group may indeed respond positively to the situation 
in whic')a special teachers are involved and because of this achieve 
to their greatest capacity while the routine in the control group 
does not elicit such effort. It is also common knowledge that with- 
out extra training the teachers in the control group cannot possibly 
iSeasjure^up to the standard , of the investigators. , In many cases the 
teachers of the control groups are not only untrdined in the methods 
fcf instruction, they may.ina^ed be unaware of the objectives for 
teaching the material while the experimental group instructors are 
teaching with specific objectives in mind. No amount of statistical 
manipulation can really cofmpensate for these differences. 

What may be thought of as a byproduct of such investigation is , 
the growth and development made by the investigators in understanding 
the many facets of how children learn, th§ir difficulties, potentflal, 
the importa^ice of motivat'ion; indeed, an understanding of the problems 
involved* in teaching at the elementary school level. The escperience. 
may ^Iso result in a more thorough understanding of how science fits 
into the total element ai?y school program. It may further result in . 
examinilig some of th^ behavioral tasks and processes in light of their 
appropriateness and "significance of young children. As is the case 
in many of our investigations into the learning of children, careful 
study of the teaching process as it^ goes on and the data that result 
cause us to proceed with c£^ution in forming generalizations from our 
experience and results. Many of the so-called studies in accounta- 
bility presently in process will do well to apply a more scientific 
attitude toward interpretation. 
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floffman, Frederic and Marvin Druger. "Relative Effectiveness of Two 
Methods of Audio-Tutorial Instruction in. Biology." Journal of 
Research in "Science Teaching , Vol. 8, No. 2; 149-156, 1971. 
Descriptors— *Audiovisual Instruction, *Autoinstructional Aids,' 
College science. Educational Strategies, *Genetics, ^Instruction, 
Research . ^ ' . 

Eacponded Abstract and Analysis Prepared Especially for I.S.E. by 
,Martin Hetherington, Michigan State University. 



Purpose a 

The purpose of this research was to analyze and compare the 
relative effectiveness of two different audiotape strategies with 
respect to their effect on student achievement, student retention, 
attitudes, problem solving abilities and critical thinking abilities. 
The^e were seven hy|)Otheses tested. 

R ationale 

Since th'0 inception of audio-tutorial instruction, no systematic 
attempt has b^en made to analyze and compare the relative effective- 
ness of different audiotape strategies. In the past many of the audio- 
tutorial programs h^ve been structured intuitively or through trial 
and error by the -^nstructor. 

Research Design and Procedure ^ 

A sample of 90 students, 43 males .^and 47 females, was randomly 
selected from more than 800 students enrolling for the first semesiier 
pf a general biology course at Syracuse University. Only five of the 
students had prior experience with audiotaped instruction. The sample 
was randomly divided into two equal sized groups, each being taught 
by a different audiotape strategy in separate audiotape laboratories. 
Each laboratory was equipped in the same x^ay.^ Students were provided 
with guidesheets for the current lesson when they entered the labora- 
tory. These guidesheets contained charts, diagrams, problems and 
instructions for laboratory work. 



At the beginning of each week the students signed up for as many 
two hour blocks l>f laboratory timfe ab they wished for that week s work. 
There were no discussion periods or lectures. ■%eneral information and 
quizzes were administered once a week. - • 

The eacperimental period lasted six weeks. The subject matter wste 
the same for both groups: mitosis, meiosis, and genetics. The major 
difference between the two experimental groups x^as the audiotape teach- 
ing strategy. 
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The direct group, received lessons of a descriptive or expository 
nature that uQualyLy involved^pa'^sive notetaking by the students^ The 
emphasis was on listening, with little or no question and answer activ- 
ity. Students were directed to examine diagrixms, f ilm loops, allde^ 
and guidesheets. There were no self-qulzze^ and very little ptohlem 
solving or investigations done by the students in this group. The 
laboratory work was guided by* audiotapes and written instructions, with 
all observations, conclusions and answers to questions stated on guide- 
sheets. The amount of student-teacher interaction was minimal in this 
method. 

The indirect group was taught by a lecture-question-answer method 
in which the students ^proceeded toward the objectives in an inyesti- 
gatory manner. Lecturing was on the essential^ information and no more. 
The film loops, slides, and demonstration materials were prl^rily the 
same as those used for the direct group. The gtiidesheets were dif- 
ferent in that they contained only a skeletal framework as a gui<ie to 
learning. The students used the guidesheets to develop answers to 
questions, form their own conclusions and solye problem^. Laboratory 
assignments were ouained on the guidesheets and the students answered 
questions and made observations to accomplish the laboratory objectives. 
The amount of student-teaeher interaction within the taped lessbns was 
much greater in this group than in the direct group ^ " . . 

An analysis instrument had to be developed in-order to determine 
whether the. two* strategies used were different. This instrument xra^ 
similar to . that used, by Flanders f-or classroonl Ihteractitsn, Nine ^ 
categories o^ classif ica£ion were used^ four categories were directed 
at the Indirect group and five categories, at the N?lrect group. The 
first four categories of classification, 1 through 4, were designed 
for the Indirect group and came under the heading of 'Wtivating 
Statements." The last Jpf categories were designed for the Direct 
group. ^^Categories 5 aifflO were under the heading of "Information 
V Giving," and 7, 8, and 9 under the heading of "Control S^tatements . " 
The taped lessons and the laboratory easetcises were analyzed using.this 
instrument. ■ ^-^ m 

The analysis instrument w^s designed to determine tl^ directness 
pt indirectness of a lesson baseS on teacher input as obtained ftfom 
tape manuscripts and guidesheets for each group. The categories were 
grouped into direct and indirect activities, according to^^the freelom 
of action of the stu4ent 4uring the taped sequence . Direct activities 
were either of dn informa<£ion giving nature or a for^ of control. 
Indirect activities w6re those in \ah±ch the student's freedom of actiOh 
wa^ considerably increased and in which he may have been mentally 
motivated br encouraged. Each taped script and laboratory guidfesheet 
was evaluated according to the nine categories.,, The* analysis was , 
reached in sequence as the categories occurred in the scripts. Cate- 
gory numbers Were' then compiled in a matrix, one pair at a time. 

To determine the reliability of the analysis tool, a Scott's . 
correlation was computed of each of 10 taped lessons using three sfe£f- 
train^d observers, ^ 

The Watson-Glaser Critical Thinking Exam, form Ym, was us^d to 
measure critical thinking' changes and transfer of learning abilities. 
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" \ A Subject matter e^tam, prepared by the inves^gator, was used to 
test differences between the two eacperimental groups in achievement , 
retentfon, an|J problem solving abilities. The reliability of the 60- 
item examHjas determined to be 0.86 using the Kuder, Richardson formula 
20. The content validity was maintained by including a cross-sectional 
sample of a large number of items representing the areas in which per- 
formance was to be evaluated. The test items were developed ty instruc- 
tors in the field of genetics. An Item analysis based on 175 students 
taking each of the 60 multiple choice items was made. 

A further breakdown of the 60 items was made to derive the two 
types of questions used in this ii^estigation. The knowledge type 
questions were intended to measure'' the student's ability to recall 
facts and make simple observation^. The use of knowledge type questions 
was designed to test the student's ability to apply his knowledge and 
solve problems, state and make judgements. The 60 questions were 
reviewed by a panel of five educators to determine the appropriate 
type- ... 

An attitude questionnaire was prepared by the investigator to 
* evaluate attitude changes in the two groups . 

Each test instriunent was given to both groups. Both groups were 
pretested and posttested. 



Findings 

The investigators found that they could train observers to use 
their analysis instrument with a" high level of agreement in their 
script and .laboratory analyses. , k 1 

They were successful in constructing a 60-item iijultiple choice 
test which had the following discriminating powers : 55 percent ^^e 
0.4 and over, 37 percent were 0.2 to 0.4 and 8 percent were 0.2 ^d 
under . 

The attitude questionnaire, which was prepared by the investiga- 
tion, indicated that, the pretest scores would reflect the two groups to 
be homogeneous at the start of the course. All the "t" values which 
were computed to determine differences bet^ween the two groups fell well 
Ijelow the required t (.05, 85) of 1.99. ^ ' 

The seven null hypotheses tested were : 

Null Hypothesis 1: At the conclusion of the experimental period, 
the two groups will not 4i£jEere with respect to their mastery of facts, 
concepts, and principles concerning heredity as measured by a subject 
matter exam prepared by the investigator. ^ 

A t-test was performed to test differences between the means of ' 
the pretest scores of the two groups. That was determined to be 0.949 
and the required t (.05, 85) was 1.99. The results of the t-test 
indicated that there Was no significant. difference between the direct 
and indirect groups with respect to their mastery of facts, concepts 
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and principles concerning heredity. Therefore, the hypothesis was notM^ 
rejected. ^ 

t 

^ Null Hypothesis 2: ,At the conclusion of .the experimental period, 
the two groups will not differ with respect tfo their problem splving 
ability aa measured by the use of knowledge questions from the investi- 
gator's subject matter exam, ^ ^ 

A t-test was performed to test differences between the means of 
the two groups with respect to the use of knowledge questions. 

The t obtained was 3.03 and the required t (.65, 85) was 1.99. 
The results of th6 t"-\e&t<Lndicate ' that there wfeis a significant dif- 
ference between the two groups. The Indirect group performed signifi- 
cantly better, with respect to problem solving abilities. Therefore, ' 
the null hypothe is was rejected. 

Null Hypothesis 3: The two experimental groups will not differ 
with respect to their ability to retain factual and conceptual infor- 
mation was measured by a subject matter test prepared ]i>y the investigator 

The observed t value for the t-test for difference between the 
means of the twd groups was 1.07' and the required t (.05, 85) was 1.99. 
Based on the results of the t-test, the hypothesis was not rejected 
since there appeared to be no signific^t difference between the two 
groups with respect to their abilities to retain factual and concept- 
ual information . 

» 

Null Hypottiesis 4: At the conclusion of the experimental period, 
the two groups will not differ with respect to their ability to think ^ 
critically and to transfer learning to other situations as measured 
by the Watson-Glaser Critical Thinking Exam, form Tm. ^ 

The t-test results for differences between the means of the two 
groups produced a t of 0.18. The required t (0.5, 85) for a signifi- 
cant difference was 1.99. Based on the test, there appeared to be 
no significant difference between the critical thinking abilities of 
the two groups, and therefore , ^hej hypo£hesis was not rejected. 

Null Hypotheses '5, 6, and 7: Three' null hypotheses we^e tested 
with regard\ to the attitude questionnaire. 

At the conclusion of the experimental period neither group will 
show a difference from their original attitude^ toward : 

(a) biolo^ and genetics as course subjects 

(b) the indirect teaching strategy 

V (c) the audio-tutorial method of instruction 

Each item on the questionnaire was assigned to one of the three ^ 
categories and each was examined tdi determine the direction of the 
shift in the response patterns of the two groups from the pretest to 
the posttest. 
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Each of the hypotheses was exanrined in light of the following 
questions : 

(1) Were there any individual items which showed significant 
changes between the pretest and the posttest for either 
group? . . . 

(2) If there were any significant changes* What wer^ the 
directions of change; increase or decr^^k (positive or 
negative)? » # 

(3) Was there a significant di'fference between the number of 
r items showing a positive shift 'and the number of items 

showing a negative shift in attitude for either group? 

Within the category of genetics and biology as course subjects, 
the pwo* significant re&ponses indicate that students of both groups 
preferred 'the genetics units at the beginning of the course. The one 
significant negative response indicated that the direct group felt 
that biology was a difficult course • 

The*indirect group felt that theVe was sufficient information on 
the guidesheets while the direct group felt that there should have 
been more. Ueither group believed that the students should be made 
to search for knowledge, but they liked the idea of weekly quizzes. 

^ • , ■ ^ 

In general,, the students felt that the merits of audio-tutorial 
instruction far outweighed any shortcomings and that it was an effec- 
' tive and inspiriiig method of teaching. They felt that they learned 
more in a shorter time, were encouraged to think critically, were not 
frustrated by the method and were helped by the guidesheet and the 
clearly defined obji^ctives. ^ ' ' 

Not all of t'hd items on the questionnaire reflected a signifi- 
cant shift from the students' original attitudes. 

Out pf ' the 10 items related to biology and genetics as coutse 
.'subjects, both groups reflected 3 positive and 5 negative shifts. 
From the six items pertaining to the indirect strategy, both groups 
showed three positive anS three negative shifts. Neither group 
reflected a significant change from their original attitudes. There- 
fore, the hypotheses concemlng^j^ihese two aspects were not rejected. 

Out of the 28 items related to audio-tutorial instruction, the 
direct group showed 28 positive and 0 negative shifts and the indirec^ 
group showed 26 positive and 2 negative shifts. Therefore, this ^ 
hypothesis was rejected since there was a difference from the students 
original attitudes toward the audio-tutorial method. 

Interpretations ' ,t . 

The results of the study indicated that both the direct and the 
indirect audio-tutorial strategies were equally as effective for teach- 
ing facts, concepts, and principles concerning heredity. Both 
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strategies had the same effect cqncernin^^the retention of facts, con- 
cepts, and principles concerning heredity. Both strategies resulted 
in a significant change in students' attitudes in favor of audio- 
tutorial instruction, y 

Neither the direct nor the indirect strategy improved students' 
critical thinking abilities. Neither caused a 'significant change in 
the students' attitude tox^ard biology and genetics as course subjects 
or toward the indirect teaching strategy^. « 

There wa.s a significant difference between the two strategies 
with regard to problem solving abilities. The results of the study 
suggested that the indirect teaching 'strategy might be a more effective 
teaching strategy th^n the direct for developing problem solving 
, abilities. re 

The tape analysis instrument developed during this study provided 
a means to objectively analyze the directness or indirectness of taped 
instruction. The res^ilts do not indicate highly significant differences 
between the direct an^ indirect, teaching strategies. 

Abstractor's Notes 

Methods to qu^tify the value of audio-tutorial teaching have 
been needed for a long time. .This paper tries to get at one technique 
to do this. The fact that; the main finding of this research was the 
interest of students in the audio-tutorial teaching is not new. The 
survey points out that of the 90 students used in the research, only 
5' had prior experience with audiotaped instruct;4.ons and that this 
instruction was in language laboratories. This mean^ that for most 
of the studenti^ the Hawthorne effect was a possibility. Similar 
rei^earch should be done with a student population who had been exposed 
to several audio-tutorial courses before the direct-indirect teaching 
strategies were applied. ^ 

Why wasn''t the attitud^ questionnaire also given to Include 
attitudes about the direct teaching strategy? 

The, results might have been expected' to come out the way they did; 
this does not necessarily mean that this is the best way to teach 
problem solving to students. 
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Hughes, Earl F. , "iole Playing as a Technique for Developing a 

Scientific Attitude in Elementary leather Trainees." Journal 
of Research in Science Teaching . Vol. 8, No. 2:113-122, 1971. 
Descriptors — Elementary School Teac'hers, ^Instruction, Methods 
Courses, Research, *Role Playing, Science Education, *Scientific 
Attitudes , *Teacher Education 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by 
William R. Brown, Old Dominion University., ' ^ * 

Purpose I ^' 

The problem of Jdevisdng an effective teaching technique for . 
developing a more positive attitude toward sc'ierice, scientific 
endeavors, and scientists as "Applied to preservice elementary tes^chers 
is the general f ecus' of this study. The null hypotheses tested were: 

(1) There is no significant difference 051 the science attitude 
measure between the control group and the combined treat- 
ment-groups of reading a persuasive communication, role 
playing, and^ observing the role playing. 

(2) 'There is no significant difference between the 'group that 

only reads the communication and the comb^ined observer 
and role playing groups on the science attitude measure. 

(3) There is no significant difference between the observer 
group and the role playing group on thie science attitude 

, measure, . 

R ationa le / 

— ■ ■ J ^ 

The author contends that elementary teadhers , should have a 
positive scientific attitude in order to more effectively teach 
elementary children an active, hands-on version of science. Since 
many preservice teachers have experienced science only as an organized 
body of knowledge, they have not been given the opportunity to form- 
ally experience the process and concept development components of 
science. As a result of this limited experience, preservice teachers 
,in general do not exhibit a positive attitude toward science, sci- 
entists, and scientific endeavors. 

A review of research studies related to existing characteristics 
of science teacher^ and science* students provides a broad series of 
conclusions that directly or. indirectly relate to the stated^roblem 
and hypotheses. Generally , elementary teachers are reluctant to teach 
science, the amount of science training for teachers is positively 
correlated to opinions held by scientists, and college science classes 
do not contribute to ^ understanding of science. 

A fundamental assumption of this study is that college science 
courses appear to .increase the preservice teacher's confidence and 
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compet^sttce in science, but they do not cause the prospective teacher 
to jfcedch for current curricular objectives at the elementary school 
le>||ol. The investigator assumed that an attitude change technique 
us ^rig perstfksive commynication and role playing would effect attitude 
change in preservice teachers as part^of a science method^ course. 
Six studies are cited that deal with attitude change techniques-. 

( Research Design, and Proceduiffe * ■ ■ / 

^ ^ A'^ur groups we're formed. Grotip C, the control, Sfe'ad an innocuous 
commuziication and responded to ten science attitude Items (post- test) . 
Group R read the per'suasive coinjminication and responded to the science 
It^msw Group RO read the persuasive communication » listened to and 
observed the role playing, and responded to the science items, > Group 
RRP read the persuasive communication, role played a ^proponent 'ef the 
positii;^ of the communication, and responded to the science items. 
The p^eirsuasive treatment was administered four weeks after the pre- 
test was giy^n. The pretest scores were used to assign subjects to 
expeirimental groups with no significant difference ^between the mean 
science attitude of the four groups. Nine days aftei' the treatment 
a post-^poat-tfiat, identical to the pretest, was administered.. 

The sample of 184 was drawn from science methods or physical 
science for elementary 'l^eachers classes from* four colleges. The 
students of an instructor from each institution were used. There were 
28 males and 176 females in the 20A post-test subjects. Of* t^ese, 
97 were juniors and 96 were seniors. Nine subject^ had incomplete 
data on year in school. 
• - ' , * 

The persuasive communication took a position on each-sof the ten 
questions included in the attitude measure* The main points emphasized 
concerned benefits from a discovery approach and science as a dynamic, 
analytical, flexible way of approaching a pri^blem. The innocuous com- 
. munication was an article on the characteristics of a good test. Both 
communications were approximately 2000 words long and required reading 
rates of 200 words per minute to complete in the allotted time. 

The science attitude mesisure , post-test , used a seVen choice 
Likert-type scale ranging from strongly agree to no opinion to strongly 
disagree. . r 

The pretest and post-post-test were identical. The ten science ;f 

related items used in the; ppst-test were eJiibedded at random within a | 

34 item oplnionnalre including science, mathematics, social studies, i 
. and language arts questions. , - . * 

A computer program using a multiple linear-regression analogy to 
a single-classification analysis of variance was used^ to test the 
three hypotheses. 
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Findings , 

Hypothesis 1 Kull hypothesis one was rejected (p < -OOl).' 
;;enty-one percent of' the variance In the post-test attitude scores 
oiv^e subjects was accounted for by their membership in one of the 
treafenent groups. This large diffexence^proyides a high degrfee of 
confidence in the effectiveness of the attitude cliange treatments . 

Hypothesis 2 - Null hypothesis two cannot be rejected (p - ^.29). 
It can be inferred that the combined role playing experience is no 
more efffedtive than only reading the persuasiye communication. . 

Hypothesis 3 - Null hypothesis' three cannot be rejected (p *f= .84). 
From the results of the test for H2 and H3 It can be inferred that 
there is no significant dif ference^±n the Relative effectiveness of 
the three treatments . 

The three null hypotheses were tested using pretest and post- 
test items. A post-po^t-test wap administered. The magnitude of 
difference between the control group and/J^tie combined treatment 
group was smaller on the pogt-post-test l^ko: on the post-test compari- 
sons. The treatment groups were' 9till significantly different from 
the control group while not significantly different ftom each other. , 



Interpretations ' , • , 

The investigator concluded that attitudes toward science and 
elementary school science instruction can»-be modified by the use 
an attitude-change technique with a persuasive communication as its .^i^ 
basic. mechanism. ' \ 

Two factors were presented that should have contributed to groups 
Roland RBP being more influenced than group R. These factors concern 
the total amount of information to which each subject was exposed and 
the I reiteration of the topics of the communication. A high level of 
acc(6ptance of the ideas of the communication prior to the role playing 
may ^-account for the failure of the treatment factors' to create a d'if- 
fer^nce between groups, 

The results of the study do not support the value of role playing 
as a superior attitude-change technique to reading a persuasive com- 
munication. 



Abstractor's Notes 

The basic component of the study deals with attitudes of pre- 
service teachers toward science and scientists. The total span of 
the study covered forty days. Since the results show only differ- 
ences between control and combilied treatment groups , tjie difference 
may not be the result of the specific treatment, but Ihe result of 
any treatment. Perhaps preservice teachers were reacting to being 
in a treatment group regardless of the treatment. College students 
learn to give answers- that they judge as appropriate to. a specific 
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situation. These students can present, pertain attitudes but liot act 
in accordance with these stated attitudes,, j 

Several possibilities exist to .ge^^ at ^.tt^e .development of atti- 
tudes j First, attitudes and changes ^'n\ att^tj'udes appear to be long- 
range phenomena. What effect might t^ei^e be ^between the four groups 
in excess of ten days?' What teaching^ beihaViors might indicate that 
preservice teachers hold certain attitudes? / Ferhaps role playing 
may be a more suitable method to learn c^r^in teaching strategies 
instead .of attitude development, Mas. attitude tievelopment or rather 
modification of attitude the subject of this study? 

The review of literature had little to do with role playing as 
a technique. It is not clear that the few studies cited dealt with' 
science i No time parameter was, given. |tt attitude change does 
occur, how lasting is this changel Kverf if the. attitude modification 
does occur, what evidence indicates that affirmative action neces^ 
sarily accompanies an .attit-ude shift? Specifically, no studies were 
cited that even alluded to 'Attitude changes of preservice teachers 
^■j-and concomitant teaching strategies^.that'.. reflected these attitudes, 

The sample was 4rawn |rom ^Inly four instructors. No control • 
as to c^-ass room* events was ted. What occurred "in these various 
classes during the forty days of th4 project was not discussed. The 
investigator stated that*, a setting was > elected to make axi experi- 
piental treatment related to science and science teaching seem a 
hatifraX extension of the normal class objectives. Were these classes 
Vequired of elective? If elective, these jstudents may have already 
held more positive attitudes toward science and thus were easily 
subject to attitude modification by any of the three treatments. 
1 . . ' 

Several unanswered q\iestions remain concerning the attitude 
measure used as a pretest, post-test, and post; -post-test. Hov was 
scoring completed?^ Students responded to one of seven points on a 
Likert scale. How were these responses converted to the' scores 
reported such as a mean of 43.33 for group C on the post-test? No 
estimates of reliability or validity were reported. Since the ten 
science items were part of a 34-item instrument, was there an effect 
due to the 24 dis tractors? Sample items should. be reported to give 
a clearer picture of the type of items used. Are ten items suffi- 
cient to gauge attitude modification of «such. a brbad area as science, 
scientists, and scientific endeavors?^ When in the course of the 
academic year was the attitude instrument administered? If this 
study was completed at the beginning of the semester, there would 
be less chance of classroom-instruction variables contributing to 
the test results. ' S 

t. ' ■ '\ ' . ^ 

\ The per suasivej communication was ^composed by the investigatot . 
Was any pilot work done with 'the commi^nication to identify reading 
level and the approprl,ateriess of the 200 words per minute time limit? 
Perhaps a fifth experimental group who did not r'ead either the 
persuasive communication or the innocuous communication would have 
been aprpropriate , The ^ffect of reading any communication could have 
been explored. 
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The onalyalo of data 4oeQ support the Inveatl^ator contention * 
£hat attitudoo toward ocience, ao meaoured by a 10*1 tem Inotrument 
dan^be modified. It dooQ not appear possible to e^ctrapolate as to 
the ultimate affect On the preaervice teacher as mfeasured by per- 
fornancQ with children. Even if the attitude modification does occur, 
does it ctalce any difference? If so, '^low do we Icnow? 

Evidently these students' were not familiar with role playing 
strategies. Hie investigator repotted that he observed very little 
new infoxmajtion or arguments from the role playinfe. If role playing 
was really the focus of the study as stated in the title, perhaps 
activities including role playing should have preceded the study. 

The statement was made thqt the majority of the sub^jects were 
probably much in favor of the point of view cf the communication 
before role playing. Possibly the role playing and the watching of 
the role playing did clarify the content of the coxnmunication. Could 
this same effect be cheated by a discussion among t^e subjects? The 
investigator does address himself extensively to possible factors that 
may account for the failure of the treatment to create a difference 
between groups. ^ 

/ 

The following statement was made that does not appear to be a 
valid generalization from this study as stated. ''The implications 
from" this study suggest that the explicit evaluation of the science«p 
programs in terms of the desired pupil activities and terminal ohjec- 
tives may be a better method of creating the desired science attitude 
in prospective teachers than indiscriminantly presenting science 
teaching materials, science curricula, and science teaching methods 
and hoping that the prospective teacher will synthesize this *infoi.ma- 
tion in such a way as to develop a 'good* science attitude." The 
study as reported does not imply that persuasive communication and 
associated treatments ^wlll aid preserVice teachers in becoming %etter 
teachers. Perhaps placing pre service teachers in public schools as 
part of their training would be more effective in attitude modifica- 
tion toward children than would a contrl-^d role playing situation. 
Perhaps participation in a research project vrould accomplish an atti- 
tude shift concerning science and scientists. The unanswered question 
remains— Even with an attitude modification that is positive toward 
science, will the elementary teacher be more "effective?" It sounds 
reasonable to expect this outcome, but this particular study is not 
addressed to this issue. 

If attitudes can be modified on a permanent basis; if th:: atti- 
tude measure is valid and reliable; if the sample was of sufficient 
size to generalize; if the persuasive communication was specific to 
the attitude modlf icati^on desired; and if the attitude modification 
in fact makes a difference in the ultimate performance of the teacher, 
then the techniques reported may be an aid in teacher training. * > 
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>Ioon» ^Thooao C», "A Study of Verbal Behavior Patterns in Prlnzary 

Grade ClasGrocma During Science Activities." Journal of Research 
in Sdlence Teaching > Vol. 8, ^. 2:171-177/ 1971. 
Descriptors—Educational Hesearch» *Elementary School Science, 
•^InservicG Teacher Education, *Interaction Process Analysis, 
*Que8tionlng Techniques, Teacher Education, Verbal Gqimiiunication 

Expanded Abstract and Analysis Prepared Especially for I^S.E. by 
Lowell Bethel, University of Texas at Austin. 



Purpose 

• The purpose of this study was to select and analyze verbal 
behavior patterns of both, elementary school pupils and teachers while 
they were participating in classroom science activities. 

The six hypotheses tested in 'the study are? There will be no 
significant differences in \ 

1. the tea'chers* l/D ratios during science activities, before 
and after the introi^uction of SCIS teaching methods and 
materials . /\ ^ ^ 

2. the percentage cf cime teachers spend talking during science 
activities, before and after the„ introduction of SCIS teach- 
ing methods and materials. 

.3. the percentage of time students talk during science activities 
before and after the introduction of SCIS teaching methods 
^ ^nd materials . 

4. the' percentage of continuous student comment during science 
activities, before and after the introduction of SCIS teach- 
ing, methods and materials . 

5. the kinds of questions teachers ask children, before and after 
the introduction of SCIS teaching methods and materials . 

6.. the teachers* cojoprehension of the process as^pects of science, 
before and after the introduction of SCIS teaching methods 
and materials. 



Rationale 

Many of the new elementary science programs require cHanges in 
the traditional role of the classroom teacher if they are to b^ suc- 
cessfully implemented^ One method for determining the degree of role 
modification after introduction of a new elementary science pr^ram / 
(SCIS) is to examine samples of verbal behavior patterns used during 
classroom science activities. Basic assumptions underlying this 



study . are : 
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i* Verbal behavior selected for analyoio io an adequate sampli * 
of the toachero* total clasorooaj behavior. 

2. The quantity and quality of teachers * ^^It largely determine 
student reactions. 

3. The types of questions asked by teachers determine the 
quality o-f teaching and the level of thinlcing being stimulated. 

• 4. The lessons observed are representative of the science lessons 
taught daily during the study. 

The investigator does cite previous litrerature to which this ^ 
investigation is related (i.e. Fischler and S^sta«iow, Snyder). 

Research ^iDesiRn and Procedure 

The sample consisted of 32 primary grade teachers locat;jed in five 
Michigan school districts (DeWitt, East Lansing, Grand Hedge, Lainza^, 
burg, an^d Williamston) . Sixteen of the teachers used Science Cur- 
ric ulum Improvement Study (SCIS) methods and materials while the 
reniining 16 teachers used conventional science methods and activities. 
In addition, the 16 teachers using the SCIS methods and materials had ^ 
been participants in a three-week SCIS worltshop the previous summer 
during the Investigation. This was the primary experimental variable>^ 

Tape recordings of the verbal behavior of the 32 teachers were 
made during each science lesson observed from April 22, 1968, until 
March 27, 1969. Formal observation of the 16 SCIS-trained teachers 
began prior to their summer in-service workshop. The SCIS-trained 
teachers were observed five times and the convent^Lonal teachers were 
observed only two times. The recorded samples of verbal behavior were 
analyzed and evaluated using three instruments: Tlie Flanders System of 
Interaction Analysis, tfee Science Teaching Observational Instrument, 
and the Science Process Test for Elementary School Teachers. The 
Process Test for Elementary School Teachers was a written test and 
was administered to the 16 SCIS-trained teachers prior to the partici- 
pating in the summer workshop and again at the conclusion of the study. 

The Flanders System of Interaction Analysis x^as used to obtain 
data to test the first four hypotheses. Jlie Science Teaching Obser- 
vational Instrument was used to evaluate the data to test the fifth 
hypothesis. The Science Process Test for Elementary School Teachers 
was used to evaluate the data to test the sixth hypothesi^. 



Findings 

The findings of the iiivestigation were as follows: 

1» No significant differences were found bej^ween the two 

teacher groups in their I/D ratios, percentage of teacher 
t&lk, percentage of student talk,, and percentage of con- 
tinuous student comment during science activities. 
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2. Ko oignificant differences were found between the initial 
oboervation and final observatiion of those teachers employ- 
ing the coitventional science activities in regard^ t^ l/D 
ratios, percentage of teacher talk, percentage of , student 
talk, and percentage of continuous ^student CGimaent during 
ocicnco activities. 

3. A significant difference was found in the l/D ratios of the 
16 SCIS teachers as measured before and after the intro- 
duction of the"" new science methods and materials . 

4. No significant differences were found in SCIS teachers* 
percentage of teacher talk, percentage of student talk, 
and percentage of continuous student comment during science 
activities. ' . ^ 

5. A significant difference was found \n the types of questions 
SCIS-tralned teachers asked their students. 

6. No significant differences x^ere found in the SCIS- trained 
teachers* comprehension of the process aspects of science 
do measured by the Science Process Test for Elementary 
School Teachers . 



Interpretations 

The l/D ratios of the SCIS teachers did improve significantly. 
However, the researcher noted that the l/D ratio of the SCIS teachers 
decreased over time, indicating that they moved from a "taore indirect 
verbal behavior pattern apprpach to a more direct verbal approach 
during si^:ience activities. A possible reason offered for this decrease 
was the materials-centered approach of the science program. Directions 
given to pupils for working xrLth the materials tended to depress the 
I/D ratios. An indication of this was ,the increase in the percentage 
of dire^Abn-givlng (Flanders Category 6) . 

TWe SCIS teachers did significantly shift from askin^low-order 
uootioiA t3^es (requiring little cognitive skill) to higher cfrder 
'quootiyno ^requiring higher cognitive efforts) . 

^ Abstractor's Notes ^ 

Many o£ the new elementary school science, programs emphasize 
inquiry skills including oral communication skills, jc^he role of the 
teacher in providing a classroom environment which fosters the develop- 
ment of these skills is critical. Thus, change in the traditional 
role the classroom teachers is required if these goals are to be 
realized. The investigator in this study attempted to assess the 
effect of in-pervice experience and new science materials in modifying 
the traditional rdle of the conventional teacher by evaluating samples 
of teachers* and pupils '^verbal behavior. However, there appears to 
be a paucity of literature* to support this viejw a? cited in the inves- 
tigation. Although space limitations considered j there should be 
more information given in this area. No sungaaries of the research 
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literature are made. Thus, it is difficult foi^ the reader to assess 
the importance of tho investigation outcomes. 

%A major shortcoming of the study is the description of the sample 
tested and the san^ling procedures* No indication is given as to how 
t;he teacher samples were chosen and why. In addition, the teachers 
chosen are so poorly described that it might be difficult, if not 
impossible, to replicate this study because of this shortcoming. Such 
factors as race, age, years' of teaching ejcperience, and sex, as well 
ao other sample traits are omitted. Further, the investigator does 
not describe the pupil sample as used in the investigation. There is 
no mention of age, grade-level, se3c, or socioeconomic level. These 
variables are all critical to replication and are conspicuously absent. 

V Although the instrumentation used, in the investigation is ade- 
quately described, no mention is made of reliability and validity 
coefficients. A reader' unfamiliar with the instruments would be a*t 
a loss to determine their true value as used in this investigation. 

Procedural biaajooay have been introduced by taping the control 
group only two times'T^ Would there be a problem in randomly obtaining 
sufficient verbal behavior samples /or analysis and evaluation? 
'Perhaps this should have T^jflgji considered. The investigator 4id not 
describe how he selected sUples of verbal behavior for analysis and 
evaiuation. How is the reader to know if there is any sampling bias " 
here? 

The results are presented in a concise^ manner . The written 
description is consistent with the data and inferences are kept to a 
minimum in the. results. The analysis of data collected using the 
Flanders Inte^raction Analysis System does present some problems as 
the investigator admits. He does offer suggestions for l)rocedure8 
and methods to be used in future studies involving science classroom 
activities. Although the results of this investigation revealed that 
the verbal behavior patterns of teachers are .altered after the intro- 
duction of SCIS teaching methods and materials, little or no change 
in the verbal behavior patterns of the children occurred. 
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Raymond i Anne, "The Acquisition of Nonverbal Behaviors by Preservice 
Science Teachers and Their Application During Student Teaching." 
Journal of Research in . Science Teaching , Vol, 10, No. 1:13-24, 
1973. 

Descriptors — ^Classroom Communication, *Educational Research, 
*Methods Research, Microteaching , Nonverbal Ability, *Preservice 
Education, *Science Education, Teaching Techniques, Verbal 
Ability 

* 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by 
Glenn H* Crumb, ftestem Kentucky University. 

Purpose . 

Research conducted over the past several years^has .consistently 
identified silence or nonverbal teacher-student interaction as an area 
needing' better description, analysis and study. There are, however, 
only a few studies reported that attempt to analyze the effect of pur- 
posefully designed teacher training activities upon nonverbal inter- 
action ±n the classroom. . « 

The study by Raymond investigated the effect of training pre- 
service teachers enrolled. in a methods course to identify and practice 
the skill of usir^ ^nonverbal cues and silence. The parameters ana- 
lyzed included "time devoted^ to nonverbal behaviors, time devoted to- . 
congruent behaviors with students , nvimber of positive nonverbal inter- 
actions and, pupil percepti^ons of effectiveness of the student teacliers 
who have completed the brief training program on nonverbal behavior. 



Rational e 

. . V 

It is an underlying assumption of this and other research studies 
on "micro-criteria" that teachers who are able to exhibit each skill 
in a set of teaching skills (at or beyond the criterion leyel) will 
display greater overall teaching effectiveness than those teachers 
whose training does not concentrate upon the development of one skill - 
at ^a time. In this research the use of nonverbal cues and silence 
during teacher-student classroom interaction was investigated to deter- 
mine the effect of training of preservice teachers in using this skill. 
Although considered separate and apart from other teachef skills, the 
data collected and analyzed included both verbal and nonverbal inter- 
action^ (classed as positive, negative and neutral) observed over the 
same period of time within the classes being taught by the experi- 
mental subjects. '-0 



Research Design and Procedure 

The sample for the study consisted of twenty preservice secojidary 
school science teachers, selected from a group of thirty-one who were 
enrolled in the mj^cro-teaching experience during the fall and winter » 
terms, 1970-71. The twenty subjects randomly assigned to the treatment 



ERIC 



26 

32 



and control groups x^ere those preservice teachers who elected to 
complete their student teaching requirement during the winter or 
spring terms, ^971. 

All subjects were exposed to the concept of micro-teaching via 
lecture and discussion conducted by the researcher ^t the initial 
meetitje of a one-credit hour seminar. For each micro-lesson to be 
developed, the specific teaching skill (e.g. set induction) was iden- 
tified, then a filmed model lesson was presented. Following this 
presentation, questions were answered. Then the subjects were asked 
to prepare a science lesson to focus upon -the particular skill. 

During the research phase, both the treatment and control groups 
developed and taught micro-lessons designed to develop the teaching 
skills' of "set induction" and "using probing questions, according 
to a prepared schedule. The treatment group developed and taught a 
third micro-lesson designed to develop skill in use of nonverbal cues 
and silence. For each micro-lesson the procedure for developing the 
skill- was the same; the skill was identifipd by lecture and discussion, 
a model was viewed on film, a second discussion/question period 
followed, micro-lessons were prepared and taught, micro-lesson critique 
by students^ Jind supervisor followed, thenJ "reteach sessions and suU- 
seauent critiaues ensued until criterion level performance was reached. 
For the first micro-lesson, >peer preserviHie teachers served as students 
and all sessions were videotaped. For the second and third micro- 
lessons, volunteer public junior high schooJ^students who were in 
study hall in the last two periods qf the day served as students for 
the mlcvfo-lessons . * 

During their student teaching experience, following the training, 
each preservice teacher in the study was asked to select a class for 
videotaping. The tapes were used for analysis of teacher behaviors 
using the B iology Teacher Behavior Inventory (BTBI) . An int^observer 
reliability of "si was finally re^ed after initially finding agree- 
ment "too low to establish ^reliability ." 

The Quality of the teacher-student .nonverbal interaction was 
analyzed from classroom observation data collected by three cooperating 
teachers and the researcher using the % ALL . The % ALL was modified 
to allow coding of the initiating source- of the- interaction . Inter- 
^^coder agreement of .79 on the Scott Index of Inter Coder Agreement was 
reached after training. ; 

Data on student perception of teacher effectiveness were collected 
immediately following the videotaped lesson conducted by the student 
teacher. This was accomplished by having the members of the student 
teacher's- class complete the Teacher Dem onstrations Rating Form. 

The record cf the videotaped teacher-student classroom inter- 
•actions was encoded using the BTBI. according to the various forms of 
expressed behavior including "verbal." "nonverbal." "congruent, and 
"contradictory." The number of seconds of each behavior evidenced by 
, thte teacher was recorded and <hese data converted to percentage of 
total behavlof observed during the class period. This procedure was 
used in order to allow comparisons of teacher datk collected for class 
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period of different lengths. Contradictory 'behavior , as a category/ 
was subsequently eliminated due to low incidence. 

Findings 

The percentage of time that each teacher displayed nonverbal 
behavior was ranked across both groups and the Mann-T^/hitney U was 
applied to test the significance of difference in the amount of non- 
verbal behavior displayed. .The hypothesis that "teachers who^ave 
identified an'd:practiced the skill of nonverbal cues during a methods 
cl^ss will devote significantly more time to nonverbal behaviors during 
the student teaching experience"' could not l^e rejected at thej^^^OO^^el . 

The percentage of time teachers spent"ext>;^ressing behaviors con- 
gruent with students was also calculated,, ranked acro§^ groups and 
subjected to the Mann-Whitney U Test. The hypothesis that the treat- 
ment group would display significantly more time'* to congruent behavior 
with students was rejected. 

The teacher-student ^interaction patterns resulting from the appli- 
cation of the 7. ALL was analyzed by determining the average number of 
observed incidences of teacher-initiated, positive, nonverbal inter- 
•^-^ctions. These means were subjected to a t-test and found to be 

sigTiificantly different beyond the .05 level. The difference between 
the means favored the treatment group. 

* . . a * . " 

The swres of teacher effectiveness as indicated by the secondary 
school pupils in the classes taught by the student teachers were 
tabulated and the mean scores for the two groups subjected to a t-tesf 
for significance of differerjca in means. The calculated value of t 
. ^was reported to He '»not significant" at the .10 level and the hypo- 
thesis that the treatment group teacher would "be perceived as more 
effective" was rejected. 

Beyond the results related to the hypotheses tested, the following 
findings were reported: " 
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A significant positive correlation existed between the ranking 
of incidence of teacher initiat^ nonverbal positive interJL. 
action and the rank order of /Ifeadher effectiveness (.05 levm 
on Spearma^i Rank Correlation). 

Th'e coijtrol group devoted significantly more time to verbal 
interaction accord-f-g to the ^TBl classification. (1) "states 
knowledge"— verbal; (2) "States knowledge"— congruent ; and 
(3) "Shows knowledge" — congruent. The treatment group spent 
significantly more time in "positive af f ectivity"— non- 
vsrbal • (Mann-Whltney U) . 

Nonverbal teacher behavior for the treatment group made up 
44 percent of their time while control group teachers devoted, 
34 percent of their time to nonverbal behavior. 




Interpretations 

The conclusion reached by the researcher was that "the experimental 
group of teachers differed significantly In their classroom behavior 
from the control group." The experimental group "devoted more time to 
nonverbal behaviors, including more teacher-initiated, positive non- 
verbal interactions with students in their classrooms," and had a 
larger total amount of time "spent in nonverbal positive af f ectivity ." 
A further conclusion was that a causal relationship existed between the 
number of positive, nonverbal inteijactlons which a student teacher 
initiates and one measure of teachea^ effectiveness. 

In summary, the study ' indicated that inclusion of identification 
and practice in the skill of using nonverbal cues and silence in the 
preser«7ice education of science teachers leads , to more use of positive 
nonverbal interactions wi-th students in Lhe classroom during student 

fching. ,From this finding,, and the significant positive correlation 
orted between the number of positive nonverbal interactions and 
dent perceived teacher effectiveness, there seems to be an impli- 
ion that such preservice training improves teacher effectiveness 
oxiu that such training should be included in science teacher education 
programs. 

/i 

Abstr^tor's N^t: 

Because of the space limitations placed upon any author of an 
article for professional journal publication a reviewer frequently has 
a host of perhaps trivial, nagging, unanswered questions regarding 
research design, procedure and the collection and L^.eatment of data. 
In most instances a review of the total research report and/qr an 
interview with the author can retire most concerns or at least reduce 
them to an understandable Vreal world of operations" level. Neverthe- 
less, questions concerning sample size and selection as well as treat- 
ment group versus control group experiences must be carefully scrut- 
inized before generalizing too far afield from this reported study. 
The reported data collection and reduction procedures /used in this 
study leave some unanswered questions about inferences based upon obser- 
vations of behavior for the treatment and control groups. For example, 
was the exposure time to school students during the training period the 
same for the treatment and control groups? Did the control group 
develop and teach a "placebo" lesson to public school students to off- 
set the treatment group's contract during the micro-lesson use of non- 
verbal cues and silence? What other experiences prior to student 
teaching may have been different due to scheduling- treatment and con- 
trol group member^ip separately? 

It is ob^rved that no mention was made in the report of the 
specific period of time over which the data were collected regarding 
teacher-student interac^ons using the % ALL . If sampling techniques 
were used, what were the^ and how were they determined? In addition, ^ 
what method was used to determine how cooperating teachers and the 
researcher were assigned to collect data? Did they know which student 
teachers were in the treatment and control group^espectively? 
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If the above questions can be effectively neutralized, then It Is 
appropriate to examine the data and its statistical treatment in search 
for "hard evidence" in support of rejecting or not rejecting the stated 
hypotheses as well as for implications and inferences for teacher edu- 
cation at large. ,^ased upon the tabular presentations in the report. 
Table III, one is struck by the fact that on the' average, the treatment 
group members displayed 'slightly less than five positive nonverbal 
teacher-initiated interactions and the control group teachers each 
averaged slightly over two. This is an exceedingly small number of 
incidences of observed behavior compared ,to the quantity of other 
behavior logged. For examlple, about 100 incidences of nonverbal 
behavior were recorded fbr each of the treattnAnt- group smd control 
group. A "quick and dirty" analysis of the data in Table III leads to 

•* the observation tha% 70 percent of the control group teachers' initi- 
ated nonverbal behavior was neutral, while about forty percent of the 
treatment group teaihers' initiated nonverbal behavior was neutral. It 

i is also observed that about thirteen percent of the nbijverbal behavior 
of the treatment group was negative whereas only four percent of the 
control group teachers' initiated nonverbal interactions fell into this 
category. This seems to suggest that whereas the report of the study 
focuses only upon nonvetbal positive behavior, it may be advisable to 
examine the data in Table III for inferences regarding other types of 
nonverbal behavior as well. Such inspection might well include the 
ninnber of incidences of behavior logged in both the verbal and non- 
verbal categories, as well as total incidences of behavior categorized 
by the cooperating teachers and the researchers. Although the data 
presented in Table I lead one ,to the conclusion that the treatment 
group devoted more time to nonverbal behaviors, the data in Table III 
indicate that the treatment group displayed about twice as many 
incidences of verbal behavior as did the control group while only a 
small difference was found between the two groups in the total number 
of incidences of teacher-initiated nonverbal behavior . 

In addition to problems associated with the teach er-init?ia ted 
behavior as presented in Tables I and III, there is^ the one of inci-* 
dence of student-initiated interactions. The number of incidences qf 
student-initiated interaction favored the treatment group (about 250 
to 330) . Most of this dif ference between the two f i^equencles of 
student-initiated behavior can be attributed to the large amount of 
student-initiated verbal behavior on the part of the members of the 
classes of the treatment group. Could this have some implications for 
the reported finding "th^t the number of positive, nonverbal teacher- 
initiated interactions correlates significantly with the student's per- 
ception of teacher effectiveness?" Could it be that the greater 
incidence of student-i,nitiafied verbal interaction implieo something 
about the quality of teaching effectiveness as perceived by the students 
in the classes of the treatment group teachers? 

The extended analysis of the behaviors of the two groups as cate- 
gorizeid, by the Biology Teacher Behavior Inventory does not provide 
sufficient information for reader analysis and evaluation. The state- 
ment that the computation of the U values 'H?as hampered by the fact 
that 'tied data' interfered with the assignment of meaningful ranks to 
the teacher" is weak and indicates limited understanding of the 
statistical procedures for, and limitations and implications of, using 
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the Mann-Whitney U as a statistic- Table VIII provides aA interesting 
inference when coupled Vith the narrative. It would lead one to 
believe that the control group talked more, "States Knowledge, when 
the observations logged and reported in^ Table III indicate that the 
number of incidences of teacher-initiated verbal interactions are 
higher for the treatment group in the ratio of about 2:1, 

In order to place ' the above comments into proper perspective it 
is necessary to examine the research scope and intent- Although not 
stated explicitly in the Journal-, it is presumed that tk±3 study was 
conducted as part of a degree requirement- As such, it should be per- 
ceived to be a very fruitful undertaking- It presents some findings 
worthy of consideration and points out some quite obvious Tioles^ in 
tesearch and practice in teacher training in general and science 
teacher training in particular- The shortcomings cited are m^Ait to 
be flags for those who would accept, at face value, the results pre- 
sented in the article. The limitations 6f the study are, such as to 
deem such action unwise. Further research on this topic does seem 
warranted and this work of Raymond certainly provides a substantial 
contribution toward its initiation. 
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Renne, Thomas, Heidi Kass and Marshall A. Nay, *'The Effect of 

Verbalizers on the Achievement of Non-Verbalizers in an Enquiring 
Classroom." Journal of Research in Science Teaching, Vol. 10. 

No. 2:113-124, 1973. " 

Descriptors— *Academic Achievement, *Behavioral Objectives, 
Discovery Learning, Educational Research, *Leaming Processes ^ 
^Questioning Techniques, Science Education, ^Student Charac- 
teristics, Verbal tonpunication » 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by 
Jal S. Parakh, Western Washington State College. 



Purpose 



The general purpose of this study was to obtain some knowledge 
of one type of classroom verbal interaction, namely questioning. 
Specifically, the following questions were investigated: (1) Do 
participants and non-participants differ in the extent to which they 
benefit, as measured by an achievement test, from the verbal queries 
of the participants? (2) What are some of the characteristics of 
students who tend to participate in asking que'stions, including seS, 
I.Q,, sociometric rating, academibmetric rating, and reading level, 
as opposed to those who do not? (3) What effect does guidance have 
on achievement, the degree of partirlpation, and the nature, quantity, 
and quality of the questions asked? 



Rationale 



The desirability for students to learn\cience by askiiig questions 
holding discussions, and interacting with each other is often mentioned 
in the science education literature, but there is a shortage of specif i 
knowledge as to how effectively the students learn from one another. 

While classroom enquiry and verbal interaction entail more than 
just question asking, the, stiidy was limited to questioning. The basic 
assumption is that ques/Sioning^is an important aspect of classroom 
enquiry. More specifically, the Suchman questioning technique was 
adopted for this study because it enables a certain degree of experi- 
mental classroom control and involves student-initiated questioning, 
which is generally considered desirable. The intent of this investi- 
gation was not to study the process of enquiry as definedf-by Suchman 
but some aspects of the questioning mode of verbal interaqtion. 

Suchman found that children who received enquiry training became 
more fluent in questioning, asked more analytical questions and- fewer 
iibstract conceptual questions, particularly of the diffuse type, than 
did the control group. 



However, when only some of the students ask questions, to what 
extent do the remainder benefit and to what extent are the verbal 
probes of the questioners taken as cue^ by the others and how is sci- 
ence achievement influenced? Gallagher found significant differences 
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favoring expressive boys over non-expressive boys on both a teacher- 
made high school biology test and a BSCS Test . The same trend was 
noted for girls but the difference was not statistically significant. 
Boys contributed more to verbal activity than girls and expressive 
students were found to have a higher mean aptitude score than non- 
expressive students. 

., ' ^ 

The amount of direction of the enquiry process desirable for con- 
cept development has been investigated among others by Craig, Kittle » 
Butts» Scandura, Butts and Jones » Tanner and Thomas and Snider. (No 
statement of finding^ of these studies was reported in the article.) 



Research Design and Procedure ^ 

Twelve randomly selected grade VIII science classefa were in turn 
assigned at random to three treatment groups of four classes each. 
The initial supply of information was the same for all classes in all 
treatment groups and was provided by one silent 8mm film loop (unnamed) 
in the Science Research Associates Inquiry Development Program series. 

Treatment A, or the guided enquiry trejatment, consisted of the 
film, a test based on the film content whicfe was called a pretest and 
was also considered to act as a guide for the subsequent enquiry 
session » and then the same test was readministered after the enquiry 
session and called a post test (emphasis added). 

Treatment B, or the unguided enquiry treatment ji ^as not to 
administer the pretest or guide, but to engage in the enquiry session 
immediately after the film was viewed and then to take the same post- 
test as Treatment A. 

Treatment C, or the self -guided enquiry treatr^ent, cons i-s ted of 
the film followed by the students being asked to submit five written 
questions answerable by **yes" or %o," then the enquiry session, and 
the same post test as in A. The writt^ questions were also used to 
identify the enquiring and non-enquiring non-participants. 

^ ' Three enquiry sessions based on the same ^hree (unnamed) films 
were conducted with each class Ijy the principal investigator in three 
subsequent meetings. A Sample enquiry session was conducted with ^each 
class prior to the experimental treatments. * During the first three 
meetings with each of the classes, the following were administered: 
the Lorge Thomdike Non-Verbal Test, Level 4, Form A; the Gates Read- 
ing Survey, Form 3; a sociogram; and an academiogram. In addltioU, 
an estimate of the extent of questioning in ^^ich each stud^t engaged 
under normal classroom conditions ^^s obtained by asking eachf teacher 
to rate his students on a five point scale, ranging from one for a 
person who ^rarely asked questions to five for a person who asked many 
questions. 

^ The three achievement tests were based on the content shown in 

^^the three films . Each of the three; ten-item tests had the four-option 
multiple choice format. Of the 30 items, 14 items .were judged to be 
at the two lower levels of Bloom's Taxonomy (knowledge and comprehen- 
sion) and the remaining 16 items were judged to be at the four higher 
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levels of Bloom's Taxonomy, Three achievement scores » namely, totSi 
achievement* achievement on the two lower levels and achievement 
based on the four higher levels of Bloom's Taxonomy, were separately 
used in the .analysis. The KR-20 reliability of the composite achieve- 
ment measure was 0,76, 



Findings 

The* major findings are listed below, ' 

a. Significant Spearman ranlc order correlation coefficients » 
weie found between the classroom teacher's rating of 
students' Verbal participation and the numbjar of questions 
asked In the enquiry session in 11 of the' 12 classes. 

b. In all treatment groups the mean I.Q, of the participants 
(students who asked at least oiie <|ue8tioxi per enquiry session 
or a minimum of three questions) expeeded that of non- 
participants (students who asked less than three questions). 
With I.Q* controlled, the mean achievement of participants 
was high^ than that cf^ non-par tic ipanfcs but significant 
differences were found only at the four higher level cate- 
gories of Bloom's Taxonomy. 

c* With adjustments for ability, the highest mean achievement 
^ in both the composite and higher level categories was . 
attained in the group that was given the pretest and called* 
the guided enquiry group or Treatment A. 

d. Sex and participation were found to be dependent upon each- 
other with more boys and fewer girls participating. The 
Ghi Square test was used. 

e. Participation was found to be independent of sociometric 
position in the classroom but not o£ academiometric position. 

f. The number, proportion and cumulative proportion of questions. 
' " were also classified according to the Suchman* categories. 

The Kolmogorov-Smimov two-sample test was used to test ^ 
for significant differences iji distribution along the cate- 
gories. At the 95 percent level of confidence, the queption 
distributions were significantly different l^etween treat- 
ments 'A and B anjd between B and C but not- between A and C. 
Tt was not possible to make statistical adjustments for the 
• differences in I.Q. sii^ce the frequency distribution' for 
fluency of verbal enquiry departed greatly ^from normality* 
Of 'all the questions asked by each group. Verification 
questions accounted for 54 percent in treatments A and C 
and 60 percent m treatment B. In terms of Implication 
. questions, the sub-categories of diffuse questions accounted 
, for 5.2 percent, 8.9 percent and 4.5 percent fof treatments 

A, B,'and C and the sub-category of directed Ijuestions 
accounted, for 19.8 percent,, 8.1 percent and 17.5 percent 
for treatment A, R and C respectively. Both groups A and C 
asked many mor^ specific cause and effect questions than did 
group B. 
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Interpretations 



From the data in the otudyit appeared that student verbal partici 
pation in the classroom was associated with achievement at the higher 
cognitive levels of SloOm's Taxonomy, 

Guidance appeared to have little effect on achievement at the lower 
levels of Bloom's Taxonomy. Guidance did appear to result in better 
achievement on the higher levels and the composite score. 

"Partldipants tended more often to be boys. This may be due to 
social reasons orto a less adequate content background . ^ 

Participants had higher I.Q.s than nonn;participants . , ^ 

Socioiaetric choices did not appear to be based upon academic 
achievement in the classroom. 



Abs t ractor's Notes 



After reading the article the abstractor was hard-pressed to find 
a clear statement of just what effect(s) verbalizers had on the achieve- 
ment of non^erbalizexs in an^senquiring classroom. If 'the major pur- 
pose of the istudy, ^mpl±eC&y the title of the paper and portions 
^ of the statement of the problem by the authors, was to determine the 
effects of participants on non-participants then that purpose seems 
not to have been achieved. 

A second purpose of the study was to determine some of the char- 
- acteristics associated with participation. The considerable amount 
of data collected pn the students' characteristics such as their sex, 
I,Q,, ability level, aociometric and academiome,tric position, etc., 
and subsequent analysis must indeed have required,, considerable effort 
and time. The findlng;s were probably thOsfe that many, if not most, 
experienced classroom "teachers would have predicted and thus the study 
servecl thd Important function of providing research evidence for expec- 
tations o£ experienced teachers (or at least the abstractor's assump- 
tion about predictions of experienced teachers) . 

A third purpose of the study was to investigate the effect of 
what the authors termed "guidance" on achievement as well as degree 
and nature of participation. In the article about 'half a dozen 
studies are simply listed as .investigations related to the amount of 
^ direction provided in the enquiry process. Thus one is unable to 
assess from the article whether "guidance'* as -used by the authors was 
the same as that used by the other investigators or something similar 
or significantly different. Guidance a» described by the authors was 
of two t3^es; (1) in treatment A, a pretest ^as administeored after 
the film but before the enquiry session and (2) the students in tre^ 
ment C were asked to submit five written questions after seeing th^^ 
film but before the enquiry session. 

Some questions remain in the niind of the abstractor. Since the 
same jiest was used as a posttest, how much of the gain in achieve- 
ment was due to taking the pretest and how much due to. the participation 
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in the enquiry sessions? To what ext^t were the questions of students 
in treatment A directed-to ^finding the answers to *the questions on the 
pretest? 

The above comments and questions refer pifimarily to the sta^iement 
"Of the problems and the purposes of the study . A number of questions 
occur regarding the procedures used. In the article it is stated 
that twelve grade VIII science, tlasses were randdinly* selected . What 
was the size of the population from which the sample was selected? 
The enquiry sessions were based on three 8imn film loops in the SRA 
Inquiry Development Program. What were the names of the films? What 
was the nature of the pilot test that was used to determine the suit- 
ability of the films.? What are the specific objectives of the films 
shown? Are the films suited primarily to develop enquiry skills as 
defined by Suchman ^d secondarily to develop concepts through enquiry? 

In raisinc the questions and making the comments listed above, the 
abstractor recognizes that constraints upon the authors in terms of the 
length of the article may have been responsible for the absence of some 
or all of the information that would have constituted a more complete 
report; of the study. Thus, the questions and comments should not be 
misconstrued. It seems clear to the abstractor that considerable effort 
Was expended on the study and interesting findings have been reported. 
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Uiban, David M., tho Ability to Infer Daficiency in IlathonaticG 
From Performance in Phyoico Using Hierarchieo . " Jouroal of 
^search in Science Teaching > Vol, 8, No. 1:67-82, Augxict, 1973/ 
Deocriptors — *Evaluation, Inotruction, Hathematico , *FhysiCEf^^ 
*Progranisd Instruction, *E2i[iedial Inotruction, Secondary School 
Science 

Expanded Abstract and Analy.Qio Prepared Eopecially for I.S.E. by 
Robert L. Steiner, The Ohio State Univeroity. 



Purpose 

The purpose of the otudy was twofold: 

1) to develop and evaluate a syotem to infer deficiencies in 
requisite mathematics abilities of students enrolled in high school 
physics, and 2) to Investigate Jthe effect of remedial instruction in 
requisite mathematics abilities on physics achievement. 



Rationale 

The absence of certain mathematics abilities has been shown to 
adversely af f act J.eaming in science (Redfield and Atwood, 1966); 
therefore, poor student achievement in physics may be based in part 
on unlearned^ requisite mathematics abilities. Identification of 
hierarchies and individual, diagnosis of requisite abilities has been 
successful in experimental work; however, to be practical for common 
use, some statistical diagnosis .of absent hierarchical abilities tor 
normal classroom work will have to be developed. If a number of 
physics problenis are analyzed and hierarchies of mathematics abil- 
ities necessary for successful completion of each problem -arc prepared, 
it should be possible to Infer student deficiencies in* mathematics 
abilities from individual performance patterns of correct and incor- 
rect solutions to the problems. 



Research Design and Procedure 

An accelerated 11th grade physics class (n ° 13) and a 12tfi grade 
college preparatory physics class (n = 17) were used in the study. 
Students in each class were randomly assigned either to experimental 
(n = 16) or control , (n =» 14) groups. Both groups received a 10 week 
programmed instruction unit on light. The unit uas divided into 4 
sections and consisted of 267 frames, 87 of xjhich were designated as 
hierarchy frames. In the 87 hierarchy frames, 163 requisite mathe- 
matics abilities were identified and remedial sequences were developed 
for 42 of the abilities occurring most frequently. The experimental 
group received remedial mathematics instruction during sections 3 and 
4 of the programmed materials. The decision to remediate a student 
was made on ^rror rates on each of the 42 mathematics abilities. 
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Student achiovGDGnt in phyoico waa Goaaured by PSSC couroe teoto. 
Two PSSC tooto bad boon adnlniotered to tho claooeo at appropriate 
lafarvalo prior to tbe otart of the 10 weeko of progranissd Eatorialo^ 
PSSC Tooto Hi and IV were adnlniotGred aftor oectiono 2 and 4, 
reopectiVGly. Aloo, critical thinking ability wao tiaaourcd uoing tho 
Watoon GlaoGr Critical Thinking Appraioal (WGCTA) . The deoign of the 
otudy lo illuotratcd in the following figure. 



ProgroEXied llatorialo 
Total Franeo 
Hierarchy Froneo 




Pt 1 
78 
22 


Pt 2 
73 
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Pt 3 Pt 4 
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Phyoico achioveEent for both groups ao Ecaoured by PSSC Teoto" III 
and IV wao 'found to be oignificantly higher for the Eiaterlalo preodntod 
using the programmed inotruction format than for the prior inotru^- 
tional method. The experimental treatment (remedial mathematico 
instruction) did not produce oany oignificont differences in the physics 
achievement of tho experimental and control groups ao measured by PSSC 
Test IV. The gain on the WGCTA wao significant. 

No significant differences between the experimental and control 
groups in physics achievement of FSSC Test IV were fotmd when covari- 
ateo including the WGCTA pretest ocores, PSSC Test III Scores, WGCTA 
gain scores and number of physics program cards completed were used. 

Fif ty-si^Cpercent of the 82 remediatlons prescribed during the 
otudy were judged as misdiagnosed ; therefore an ex post facto exami- 
nation of the diagnosis procedure was undertaken. Performance pat- 
terns represented below were developed for each student on each of 52 
mathematics abilities identified for part 1 and 2 of the materials. 
A plus was recorded in each of 

the quadrants where the par- Part 1 . P art 2 

ticular performance indicated 
wao exhibited at least once. Correct 
All sixteen of the possible 
arrangements of + and - were Incorrect 
exhibited in the 1456 indi- 

vidual patterns (28 students x 52 mathematics abilities) . The 16 
possible patterns were classified in the following manner: 

ip 

1) no change (2 patterns) consistently correct or incorrect 
responses on parts 1 an^ 2. 

2) inconsistent with valid measur-e (4 patterns) incorrectly 
responding on part 2 after correct responses on part 1. 
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3) consiotent \rith valid measure, incidental learning (2 
patterno) reoponding correctly ^on part 2 after incorrect 
responses on part 1. 

4) no^ data (8 patterns) no clear information on student per- 
formance trends. 

Using all student patterns, the observed frequency of + and - 
for each quadrant was. calculated and the probability of each of the 
16 patterns calculated. A chi-square test of predicted and observed 
numbers wao made for each of the 4 categories of performance patterns. 
The test was significant at p<.001, indicating that the patterns were 
not simply a collection of random data. Next, 28 of the 52 mathe- 
matics abilities were removed from consideration because of unequal 
or limited distribution in' parts 1 and 2. Performance patterns for 
the remaining abilities were calculated and a chi-square test of pre- 
dicted and observed numbers was maye. The result was again signifi- 
cant, indicating that the performance pattems^eould be used to 
measure deficiencies in mathematics abilities. The performance pattern 
method was improved even more when only 11 mathematics abilities with 
BiQXlmum independence were used. Had performance pattemE? been used 
for making remediation decisions, instead of error^ rates as used in , 
the study, incorrect remediations would have been reduced from 56 to 
35 percent . 



Interpretations 

Failure of the experiment to improve physics achievement by remed- 
iation in mathematics abilities may have been related to an ineffective 
system of diagnosing the need for remediation. Perhaps diagnosis based 
on a tally system end performance patterns would have yielded different 
results. 

♦ 

Two performance patterns of cons"istent measure but indicative of 
incidental learning were much higher than anticipated and incidental 
learning was four times as important a source of mathematics remedi- 
ation as the procedure undertaken in the study. If incidental learning 
is'coBjmon in physics, instruction it might indicate that specific mathe- 
matics deficiencies are not a serious problem. Alternatively the 
physics programmed materials may have helped students learn remedial 
mathematics and this i^'why all groups did well. 



Lack of mathematical abilities is cited as a hinderance to success 
in physics by both students and teachers. This study addressed itself 
to this problem. 

Although the stated* purpose was twofold, the experimental portion 
was limited by the ineffective diagnostic system for identification of 
those in need of remediation. One is left with the impression that 
the results of remediation of requisite mathematics abilities on physics 
achievement were insignificant and as*a result, rather than concluding 
that the treatment was ineffective, an alternative explanation was 
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offered. It was suggested that the method of diagnosis o| need for ' 
remediation may not have. been effective and that a more ciireful analy- 
sis ill the method of identification was in order. Since the diagnosis 
for remediation was of major importance to the study, it would seem 
that this should have been done and evaluated before the study was 
^•begun., not ex post facto. 

The title of the study and the eajrly discussion of hierarchies 
implies a. greater dependence on Gagni type hierarchical structure 
than is .warranted . The author was unable to develop agreed ed upon 
hierarchies of requisite abilities. Instead, a number of mathematical 
abilities, which may orfmay not have been dependent on each other, 
were identified for eafih frame. Albeit the author did acknowledge 
this problem, indicatuig . . each 'hierarchy' was judged as an 
unordered set o^ requisite abilities for the problem in question." 
The inclusion of hierarfchical structure could have been omitted and. 
Instead, the problem approached in terms of ^ number of separate 
mathematical abilities. . ^ 

' T 

It was concluded that the programmed instructional format was 
more effective than the prior instructional format because of signifi- 
cantly higher scores on the PSSC Tests III and IV. Both the experi- 
mental and control groups used this mode* hence there was no control.^ 
Alternate hypotheses could be offered for the significantly higher 
scores on PSSC Tests III and IV. * ' 

Although significance tests were made between the experimental 
and control groups, most of the data was presented in terms of either 
the initial classes or the four subgroups. It would have been helpful 
and perhaps more meaningful to have the datp also presented in terms 
of experimental and control groups. Iii addition, initial differences 
were lndj.cated between the accelerated and college preparatory classes, 
but the article does not indicate if there were any^ initial differ- 
ences between t^he experimental and control groups. Even though random 
assignment of the students iiv'toth classes to experimental and control 
groups was made, significant differences between the two groups could 
have existed and should have been tested or reported if a comparison 
was made. „ 

The raw scores on the PSSC tests of the two classes and also for 
the regional student data used for the standardization of the scores 
would have been helpful. It is somewhat surprising that the accel- ^ 
erated and college preparatory classes both scored below the regional 
norms. The article does not give any means to verify the results pre- 
sented. 

/ , 
PSSC Test IV used to measure physics achievement may not have been 
a valid measure of thfe effect of remedial mathematiqs instruction. No 
analysis was made or indicated to determine if the mathematical abil- 
ities remediated were necessary for successful completion of the 
questions on the test. Perhaps all , abilities remediate4 were not needed 
in the PSSC Test JV and hence would 'not affect physics achievement. 

The remediation sequence included an entry problem requiring the 
mathematical ability in question. If the student could do the entry . ^ 
problem he was directed to an exit probleia also reqxiiring the ability. 
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If he could correctly 'work the exit problem he did not work thr:Ough 
the sequence. The study indicated that 56 percent or 46 of thQ 82 
remediations wer^ judged aa misdiagnosed. This suggests that only 
36 remediation sequences were done fo^: parts 3 and 4.- This Is only ' 
a little more than 5 percent of the possible 672 remediations (42 
mathematical abilities x 16 students in the experimental group) for 
parts. 3 and 4. It could be that all 42 n^thematics abilities were 
not fojutnd in parts 3 and 4; therefore the total number of possible 
remediations would be smaller. In, any case it seems as though so • 
few 'Students were initially lacking the mathematics abilities that 
the effect of remedial^ instruction had little potential to have an 
effect. u . 

^ Some results presented in the §tudy do -not seem possible from 
the^ data. In the "Kesults" section (pages 71-72) two of the mean 
differences given do not agree with the standardized data given> in 
Table II. The ^aver^ge gain between PSSC pretests and PSSC Test III 
is given as 6.*63 points with a gain of 4.00 points for the accelerated 
class and 8:65 points for the college preparatory class, but the data 
in fable II indicate an overall mean gain of 4.87 points with the 
accelerated class gaining 1.81 points and the college preparatory 

^ class gaining 7.21 points. Also in the '"Results" section, the dis- 
cussion indicates 1201 hierarchy frames contributed data, but Table. I 
indicates that the total number of hierarchy frames for both groups 
totals only 1021. The cards per individual for the accelerated, class 
reported in Table I also see^ in error. - It appears that the author 

"divided the mmbpr of remediation cards by all the students in the 
accelerated class, whereas only. 7 of the 13 students were in the 
experimejital group and hence particip'ated in the remedial frames* 

- :Seveto of the nine (no data) performance pattefns presented on 
• page' 74 .are theoretically possible, but, if they occurred with any ^ 
great ,frequency,'^it^ would suggest th^t the mathematics were not very 
evenly distributed "throi^hout 'the prdgram and the mathematl^p^ abil- ^ 
ities necessarj^ for part 1 were not hecessaYy for part 2 and vice^ 
versa. If this were true throughout the four units, it would not be 
surprising that remediation based on one unit would not be of help ^ ^ 
for the next. « » ^ 

On page 75 the number of students used for c^ifilation switches 
from 30 to 28 for no apparent reason an<i without ^Idnation. 

*• . . 

Calculations based on data in Table IV (p^ge 78) give chi-square 
contributions ot 37 and 86 respect iveiy for the "No Change" and 
"Inconsistent" categories and not 45.1 dnd 82.9 as repoi;ted. Sim- 
ilarly in T^ble VI calculations based on the data give a chi-square 
contribution of 323.7 for the "Consistent" category, rather than the 
347 indicate^d in the table. 

The 'discussion on page 76 considered an examination of the more 
evenly distributed mathematics abilities over parts 1 and 2. Twenty- 
eight of the original 52 abilities were removed from analysis, leaving 
24 - not 26 - as indicated in the discussion and in Tables V and VII. 
The use of 26* instead of 24 also causes an errdr in the number of 
performance patterns which should be 673 and not 728- as reported. 
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The use of 728 probably produces errors in the frequency presented ^ 
in Table V (page 76) . 

* « 

In addit^^ to errors, there were several confusing points in the 
article. Soma ai^he results are not clear from the data and dis- 
cussion presented, all of which contribute to the difficulty in read- 
ing and interpreting the article. Perhaps additional discussion 
would have clarified some of the confusing points and apparent errors 
in the . article. s/ 
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Rlechard, Donald E., "Life-Science Concept Development Among Beginning 
Kindergarten Children From Three Different Community Settings." 
Journal of Research in Science Teaching , Vol. 10, No. 1:39-50, 
1973. 

D,escriptors — Cu^ltural Factors, *Ear»l^f Childhood Education, 
*Educational Research, *KindergarterJ Children, Performance 
Factors, *Science Education, Social factors , *Socioeconomic' 
Background ■ - 4* 

. ■ 

Expanded Abstract and Analysis Prepared Especially for I.S;E. by 
David H. Ost , . Calif omia State College. 



Purpose 

^ The basic question investigated in this study was: Do children 
from three ^different community settings (inner-urban, outer-urban, 
and rural farm) vary significantly in the ability to conceptualize 
s^elected aspects of life science at the time they begin kindergarten? 
The study also addressed the parallel problem of identifying those 
factors which are useful predictors of scl^nc^ concept development 
among beginning kindergarten children. 



Rationale 

A major premise of the study was that "concepts" are the basis 
or structure for problem solving and of her higher order thinking pro- ^ 
cesses. If this is true then the assessment of , the ability of children 
to conceptualize science or the concepts with which children enter 
kindergarten is important to science curricula and instruction. (It 
Va not entirely clear which ofv these is really being investigated.) 
References are made to the increased attention being paid to early 
childhood education and the resulting associated research. This 
information is used to amplify the importance of knowing the relation- 
ship of a child's background to his/her conceptualizing ability. It 
is pointed out that little research has been done in this area par- 
ticularly as related to science education. 

Research Design and Procedure 

fhe investigation was of a descriptive nature; as such, no cause 
and effect relationships can be clearly defined. Six phases of the 
s^udy are reported. 

The first phase consisted of the development and pilot testing 
of the concept assesstnent instrument, Life-Science Concept Acquisition 
Test (L-SCAT) for children ages 4-6. The L-SCAT consists of 21 sets 
of colored pictures and an accompanying interview schedule. The 
pictures represent 35 concept items representing the seven BSCS con- 
tent themes. A panel of judges critiqued the L-SCAT. 




The instrument is an individually administered, picture-stimulus 
structured interview. The subject responds nonverbally and verbally 
about the pictures. Questions by the interviewer are based upon sub- 
ject responses. Scores are attained for the nonverbal portion, the 
verbal portion (the tape record is ^ scored in accordance with a criterion 
scale) » and on each conceptual scheme." The instrument is not published 
in the ireport. * 

The pilot test was made with two populations: kindergarteners 
of Caucasian blue collar workers (n = 18) and pre-k±ndergarten children 
enrolled in a Head Start program (n = 13) . No discussion is provided 
as to why these populations were selected. Item analysis, teacher 
ratings of children, as well as tesrt-retest procedures were utilized 
to determine the validity and reliability of the instrument. No 
reliability coefficient was given. % ' 

( • . ' 

The second phase of the reported investigation was the selection 
of the population. The three^ community settings were defined on the 
basis I of: geography; percent of families receiving Aid to Families of 
Dependent Children; occupation of residents; and, distribution of 
inhabitants. Additional socio-cultural data concerning the child's 
background and experience were taken from a Kindergarten Enrollment 
Information Form. 

Phase three was an attempt by the investigator to 'build a rapport 
with the subjects before collecting data from the subjects through the 
administration of various instruments. The administration of L-SCAT 
to 17 children from each community setting occurred in phase four. 
The subjects were selected at random by means of a table of random 
numbers. Phase five of the study was the administrating of the Cali- 
fornia Short-Form Test of Mental Maturity, Level 0, 1963 Revision. 
Phase six was the analysis of data. One way analysis of variance was 
employed to examine differences- among scores made on the L-SCAT by 
subjects from the three community settings. This was followed by the 
use of Tukey (a) procedure as a means of determining the source of 
significance. Step-wise multiple re/gression analysis provided infor- 
mation concerning the value of predictors of performance on the L-SCAT. 

Findings 

Beginning kindergarten children from the three community settings 
differed significantly in performance on all the L-^-SCAT measures (non- 
verbal, verbal and total). The order from highest to lowest being 
"outer-urban," "rural- farm," and "inner -urban." Analysis of the audio- 
tapes suggested that much of €he variance may have been in fact due to 
differences in concepts and concept formation unrelated to science. 
The investigator listed examples of such concepts as "ttnder," "before," 
"different," etc. 

The step-Wise regression analyses Indicate that I.Q. scores were 
the most useful predictor of L-SCAT performance. Chronological age 
was the second variable entered Into the equation. When only soclo- 
cultural variables, were used In the regression analyses for total score 
on the L-SCAT, the subject's mothers education was the first variable 
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entered (most useful predictor). Younger siblings, "extended family" 
and zoo visits were entered respectively with increased power of pre- 
diction. The ^only child" out-performed the child with siblings. The 
"extended family child" did not perform as well as t^e child who did 
not live in an extended family environment. 

Interpretations 

The investigator points to the ability of children to deal with 
the abstract c6ncepts of time, change and relationships as' important 
attributes, related to some science concepts such as evolution. This 
interpretation is ba^d, in part, on the resulta of subject's per- 
formance on the evolution theme (which is apparently onb^ of the BSCS* 
themes ^included in the L-SCAT) This supports the premise that science 
education for young children should not be drawn strictly from the 
domain of science; physical, mental and sociocultural factors must be 
given more than- superficial consideration when designing science educa- 
tion programs. 

Chronological age is cited as being a factor to which more atten- 
tion needs to be paid. The range of 7.8, months in the chronological 
age of the subject represents J.0 percent of the total time an. average 
kindergarten child has lived. This if act is particularly relevant, 
according to the investigator, in. that it is cited that 50- percent 
of the "intelligence" an individual has at age 17 is supposedly 
developed by the time the individual enters school 

Attention i? drawn in the report of the study to variations in 
L-SCAT performances of children as a function of **birth-order rank" 
and other .^€±|ted factors. The investigator raises the possibility 
of aiming earffy childhood science education at the mothers of young 
children. This notion is amplified further by the suggestion that 
since children already v§ry significantly in development prior to 
formal school it may be appropriate to affect "other people" in the 
child's environment who would be able to influence the development of 
science and other concepts. 

^ Abstractor's Notes 

. • ■ ' 

Aore questions are raised as a result of this study than are 
answer^; that seems to be the nature of a descriptive investigation.* 
Descriptive investigations may have greater latitude in the design, 
greater leeway in- the foci Qf study and produce results which are 
less elucidatory. However, the investigator has other constraints 
and responsibilities which* are not generally associated with 



* A descriptive investigation, as considered by this abstractor, is 
one which is more decision-oriented than conclusion-oriented; describes 
rather than explains; suggests the existence of relationships rather 
than demonstrating cause-effect; or, it may contribute to the solution 
of a specific or practical problem rather than explain the basis of 
the problem. 




• / - 

traditionally designed experimental investigations. To have its 
greatest use, a descriptive study must provide as much detail as 
feasible. In this study the investigator neglected to provi-de data 
concerning the reliability of the instrument; an instrument upon which 
the entire investigation was built. No rationale is provided for the 
populations utilized in theVilot testing of the L-SCAT. IThy was the 
instrument not f ie Id- tested ^i^l^jsdfefte population which subsequently 
scored the highest? ^ C . 

0 

The investigator provides no rationale for the use of the F-test 
with a relatively small population. It appears that the choice was 
based on the poWer of the test. In a similar manner, the establish- 
ment of the Sljpfia level (.05) appears to have been made to increase 
the power of the test. For what reason? If calculated ratios are 
being utilized as descriptors, then the level of significance or the 
power of the test is not all that important (except for the reviewers 
of articles submitted for publication) • The^ importance of signifi- 
cant differences in descriptive studies of the type reported here is 
of questionable value. It would have perhaps been more prudent and 
prbper to discuss at greater length the Type 1 and Type 2 errors 
associated vrf.th the study. The use of the F-ratio in conjunction 
wltl^ such discussions is of greater meaning in descriptive an^ysis 
than fitlmply knowing whether "it is statistically significant at the 
.05 level.", 

» ^ ;\ ■ 

Similarly, it would seem that a discussion of the rationale behind 
the choice of tlie Tukey (a) procedure over the Scheffe method would be 
of value. The latter is considerably more conservative with respect 
to Type 1 errors than is the Tukey (a) method. Such information is 
important if the investigator elects to report his/her investigation 
in^a descriptive fashion, let alone design it as such. 

Step-wise regression analysis is a very useful tool in descrlfetive 
studies. Again, the value of such a statistical tool may not lie as 
much in the statistical significance of entered variables as In the 
or4er of entry. The list of independent variables compiled Iq the 
study is lacking. Hence, much descriptive information is .lost. It 
would be of value to know which factors were found to be of little or 
no use in predicting L-SCAT scores. It would also be helpful to know 
how t;Jie^independent variables were selected. (Given a list of 50 
random variables, chances are that several will "he "statistically 
significant" at the .05 level.) Of course, those researchers inter- 
ested only in significant differences using different statistical tests 
are not impressed with such information. 

Although. the investigator inSicates, that the subject receives a 
score on each conceptual scheme found in the L-SCAT^ nowhere in the 
report is there a discussion of this item. (No significant differ- 
ences?) To the researchar in science education these data would seem 
to be as useful as, if not more important than, the verbal and non- 
verbal scores . 

A commesit in passing has to do with the significant figures used 
in this study and the majority of educational research. It is difficult 
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to ascertain the reason behind calculating a mean score (with an n 
= 17) to four decimal places* The science concept of significant 
figures seems to be undeveloped among education researchers. 

in summary, the study may be timely although the report is 
deficient in some aspects. With the increase in att-ention being paid 
to research in early childhood education the results of studies such 
as these may ^ay large -dividends . Multiple discipline studies may 
pr6vide much greater insight into the areais of early childhood science 
education. It is hoped that investigators who design studies which 
are intended to be *evaluatory or descriptive report their results in 
a manner congruent witli that philosophy. No doubt a full report of 
the investigator would answer many of what some would consider mechan- 
ical or diminutive questions raised* in this abstract. Unfortunately 
the reader has nothing but tljat article from the Journal of Research 
in Science Teaching which provides: less than adequate information 
about the sociocultural , physical, and mental variables used in- the 
study; questionable background and general Information concerning the 
major instrument in the study (L-SCAT) ; little or no information 
specific to science education or instructj|on; and, , little rationale' 
for ftuch of the design of the investigatibpi. ^ 
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Siegel, Betsy Davidson and Ronald Raven, "The Effects of Manipulation, 
on .the Acquisition of the Compensatory Concepts of Speed, Force,' 
.and Work." Journal of Research in Science Teac hing, Vol. 8, No. 

4; 373-378, 1971;^ ~; 

Descriptors— *Cognitive Development, Concept Formation, *Ele- 
mentary School Students, Force, *Instruction, *Leaming Theories, 
Psychology^ 

Expanded Abstract and Analysis Prepated Especially for I.S.E. by 
Mary Budd Rowe, University of Florida. 



Purpose 

Do students who actually manipulate materials (variables) learn 
concepts of speed, force and work as well or better than do those 
who see the concepts demonstrated? , 



Rationale ^ 

♦ 

Most modern science programs stress the value to the student of * 
actually performing activities and conducting investigations of 
various magnitudes. This activity-based approach to science instruc- 
tion is meant to help students acquire concepts, learn to control and 
manipulate variables, and to develop some sense of the tentative 
nature of knowledge 

Huttenlocher (1), however, showed that students who watched the 
experimenter do the manipulations rather than performing manipula- 
tions themselves, performed better on problem solving tasks than did 
the students who actually worked ^ith the materials. She suggested 
that manipulation by the students may have interfered with the proc- 
esses of remembering and interpreting information. 

Siegel and Raven challe^ed'Huttenlpcher's findings which were 
at the time already nine years old. They argued tl?at research by 
Inhelder and Piaget' showed that children' in the middle stage of con- 
crete operations can solve problems which involve compensating vari- 
ables by manipulating one variable at a time and comparing differences 
in the oxher variable. Nine to ten year olds must be p^fesented, they 
reasoned, with concrete objects. If only verbal data or viewing in a 
demonstration is all/6wed, children cartnot set up the problem in such 
a way as to make what happens meaningful. i 

Research Design and Procedure 

To find out under what conditions fourth graders were more likely' 
to be able to cbordinate two Vr more variables that reciprocally opppae 
each other, Siegel and Raven assigned each of 120 fourth graders to 
one of three treatment groups. The manipulation ^and demonstration 
groups received nine hours each of instruction. Each child in these 
tw^ categories received a workbook, prepared by the investigators. 
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Thio workbook contained schematics for each experiment and questions 
to be answered. Each child , in the manipulation condition had his own^ 
set of equipment. Children in the demonstration gtoup watched experi- 
ments being done. At the conclusion of instruction each child was 
tested individually . Apparently there was no placeba for the control 
^roup. 

Three contrasts were made. First, 20 students were selected at 
random from each group to determine whether the three groups ^ere in 
fact equivalent on the outcome measure CKruskal-Wallis test). They 
found that at least one linear contrast was different from zero so 
the next problem w3s to find out where \j:he difference was. A pro- 
cedure described by Marascuillo and McSweeney showed that the signifi- 
cant difference was between the control group and the treatment grpups. 
But, in the third step, the linear coixtrast between the treatment group 
means was not significant, i.e., the demonstration and experiment 
treatments were equally effective. 

The possibility still had to be considered^Kiat One or more of 
the concepts, speed, torce, or work wi^s betteilflkQuired under one 
treatment condition than another. The Friedman^wo-way analysis of 
variance performed on the remaining 20 students in each grot^p yielded 
no differences among the task categories. 



Findings 

There is no difference between the manipulation and demonstration 
treatment conditions on the outcome measure. Students learn equally 
well concepts of speed, force, and work under either training condition 
The trained students do better than the untrai^ied students . 



Interpretations 

Siegel and Raven maintain that their work shows that training can 
stabilize a compensation scheme in fourth grade children. 



Abstractor's Notes 

The financial implications of this study are enoxinous. The 
investigators show that training by demonstration is as* effective as 
training by active l^^boratory invblvement , at least so far as acquisi- 
tion of compensation schemes goes. It would seem that Huttenlocher ^s 
forecast had merit — although oddly enough — Siegel and Raven never 
return to a discussion of their findings in the context of Hutten- 
l^her'ff work which provided the Impetus for the study. Does manipu- 
lation InterfcrB witn the tasks of i^emembering and interpreting, as 
Huttenlocher. suggested, or are the opportunities for rehearsal of 
ideas less, likely to accompany a laboratory activity? Dewey often 
commented t^iat activity Tolthout reflection is a waste of time. 

It may be that a more refined question needs to be ksjced before 
we 'Stop buying equipment in class lots and begin returning to a 
demonstration mode. Perhaps some students do well under one kind of 



instruction, e»g., manipulation and .others do well under another, e.g., 
demonstration. A post hoc analysis^ would quickly show whether such 
were the case. In short there could be an aptitude-treatment inter- 
. .. action. We might surmise, for ^ample^, that the higher ability segment 
would do better under the demo^tration condition while the lower 
ability segment would thrive uider the manipulation condition. This 
kind of result would fall in line with recent studies in elementary 
science which show that low SES groups appear to benefit from labora- 
tory type programs in the primary grades while middle and upper SES 
groups do not (2) . 

The control group received nothing but the outcome test, not even 
training in the vocabulary which was central to the outcome measure. 
Thus, on the one hand, the finding that the instructed groups do better 
is in one respec| trivial. On the other hand, the fact that students 
can learn a cOTipensatory scheme through spec^ific instruction is not 
trivial. In so far as an evaluation of the' Huttenlocher proposition 
goes we have/ in this study neither inf irming nor confirming data since 
^ the two tr^tment groups do not differ significantly. However, the 
mean scor0 for the demonstration group, while not significant, is 
higher on each of the three concepts, and one wonders what pattern 
would emerge with a larger sample. If Siegel and Raven still have 
their 'original data, it would be nice to get some preliminary idea as 
to how tenable the aptitude-treatment interaction hypotheses is. 
Macbeth, for example, showed that kindergarten children who manipulate 
science materials attain science process skills better than those who 
do not have this opportunity, ^t the third grade, however, the manipu- 
lation group did no. better than the non-manipulation group (3). 

The outcome measure appears to be closely related to the training 
content. So we do not know whether a generalized compensatory scheme 
was learned or whether compensation of three variables, once acquired, * 
facilitates acquisition in another context. Neither do we know whether 
retention for the two experimental conditions is equally good. It was 
not necessarily the job of the investigators to answer these questions 
— but someone should, sometime. 
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Summerlin, Lee and Marjorie Gardner, "A Study of Tutorial-Typo Coiaputer 
Assisted Instruction in High School Chemistry/' JouiTnal of 
Research in Science Teaching , Vol. 10, No. 1:75-82, 1973. 
Descriptors — *Chemistry, *College Science, *Ccnnputor Assisted 
Instruction, Educational Research, '^Instruction, Instructional 
Media, *Science Education, Secondary School Science, Teaching 
Procedures 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by ^ 
J» Dudley Herron, Purdue University. 



This research compares the effect ivenesa^f a tutorial (as con- 
trasted with drill-and-practice) CAI program in high school chemistry 
with conventional Instruction. 



•Rationale 

Although a number of CAI programs have been developed to provide 
drill or practice over materials taught in other ways, there are few 
programs for chemistry at the high school level that present new con- 
cepts via CAI. This research was an attempt to develop and test such 
materials. 



Research Design and Procedure 

A group of 110 students enrolled in chemistry at the University 
School at Florida State University were randomly assigned to treatment 
lind control groups. The treatment giroup (N ° 58) was given instruction 
on (1) structure of the atom (quantum mechanical model) , and (2) chemi- 
cal bonding (ionic, covalent, van der Walls, hydrogen) and molecular 
architecture via CAI at the CAI Center at Florida State University. 
The control group ([W = 52) received instruction over the same topics 
in their regular chemistry class. During the time of the experiment, 
studdhts in the treatment group reported to the CAI Center during 
their regularly scheduled chemistry class and had no contact with their 
chemistry instructor. Students in the treatment group were allowed to 
proceed at their own pace and had access to the CAI terminals after 
school hours and during study periods as well as during their regular 
class. Kecords were kept on the student responses entered, latency 
(time required by the student to respond to the computer) , total number , 
of correct answers, total time spent at the terminal, ^d any other 
response the student type J into t>ie» terminal after he was signed on. 
There wpre no absences in the CAI group during the study. All CAI 
students completed the entire program. 



Students in the cobtrol group were taught the same* material 
covered in the CAI program. This material covered four chapters in 
the student's text (Chemistry by Choppin and Jaf fe) . The control group 
spent 15 class periods covering the material included in the experiment* 



Purpose 
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(RGViC2WQr'9 Note: ThorG io ooce aiabiguity in the author 'o report of 
the tice involved in tho eKpericant. At the bottoa of p, 78 refer- 
ence io nade to "the two week period of the otudy" but on p, 79 it io 
otated that "all control group otudento opent three weeko covering the 
oacQ caterial included in the CAI program," Thio difference may 
olnply reflect the difference in time required by the two groups to 
ccaplete the material ao indicated below.) 

Each otudent in the troatnent group took a 60-item, multiple- 
• choice teot imediately upon completion of the experimental EultGrial. 
Studento in the control group took the ooms teot ao a group in tho two 
dayo following the completion of the inotructional material • A 35- 
iten teot wao adminioteted to both treatment and control groups 60 
dayo after completion of the experiia^nt. Both tests were constructed 
by the author by selecting teot items from the ACS-NST Cooperative 
Examination in High School Chemistry (1952, 1963, 1965, 1967, 1969), 
the Anderoon-Fiok Chemistry Teot (1966), and the CHEII Study Achieve- 
ment Teot (1963-64, 1964-65). The Kuder-Richardson 20 reliability 
eotimateo for the 60-item teot were .87 and ,85 for the treatment and 
control groupo respectively. The correoponding Kuder-IlichardsdlFi.20 
estlmateo for the 35-item test were'. 69 and .67, 

Teoto of the hypotheoes of no differences in mean score on the 
above tests were made by the t-test. 

\ 

Findings 

The difference in mean for the t-reatment group (X ° 20.66; S.D, ^ 
6.49) and the control group (X ° 24,19; S,D. ° 9.36) on the 60-item 
test administered at the close of the e:cperiment x^as found to sig- 
nificant'at the 0,5 level. The '35-item delayed posttest showed^ 
differences in the same_jdirection and were significant at the smne 
level of confidence, (X Treatment Group = 10.66; S.D. ° 4.64; X 
Control Group = 12.62; S.D. 4.64) However, the CAI students com- 
pleted the instructional material in an average time of 275 minutes 
compared to 750 minutes spent in regular class instruction. It was 
also found that students in the CAI group had a favorable attitude 
toward their work. (This result was previously reported and is only 
mentioned in this article,) 



Interpretations 

"The data collected in this study Indicate that students learn 
moro, as measured by posttests, with typical classroom instruction 
in chemistry than they do with tutorial-type computer assisted 
instruction." ..."Data collected in this study indicate that students 
can complete the same amoyint of material via CAI in Iqss than one- 
half the time required by^ students in the typical classroom.'* ... 
"strong student interest and favorable attitude tox^ard CAI imply that 
this mode of instruction can be used effectively. Further, it is 
suggested that the difference between the mean scores on the post- 
tests of the two groups in this study, ^. is small. When this is 
compared with the positive student interest, attitude, and 
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tlDC-cconoay , the pooitivg aspecto of CAI outweigh the negative acpects 
(p. 81 of article under review.) 



0 Abstractor 'o Noteo 

Thio otudy of CAI in chemistry inotructibn conatitutes a worth- 
while addition to our understanding of CAI. It would appear that the 
otudy wao carefully conducted and this reviewer sees no reason to 
question i^o internal validity of the study. ^The reviewer Jalso agrees 
with the aiXthor that the differences in mean achievement by*the treat- 
ment and control groups, though statistically significant, are sme^ll 
enough that one should not argue strongly for one mode of instruction 
in preference to another. As indicated by the standard deviations in 
the scores, variance within groups was considerably more than the 
variance between groups. Factors such as cost (not mentioned in the 
study), tine (bp^h developmental and instructional), and attitude 
(both long range and short range) must be considered along with the 
data on mean achievement in deciding *f or or against tutorial-type 
computer assisted instruction in chemistry. 

Considerable care must be exercised in generalizing the results 
of this study to other instructional situations. No description of 
the students involved in the study is g^ven. If the majority of the 
students involved in the study are bright, highly motivated students 
(or dully poorly notivated students), the results may differ con- 
siderably from tho^e that would be obtained with a different student 
population. It should also be noted that the study compares the 
effectiveness of a particular CAI program with the effectiveness of 
classroom instruction of a particular teacher. The lower. achievement 
by the CAI group might be improved by an improved CAI -program. The 
longer time required by the control group might be shortened through 
more effective classroom instruction. The study provides little 
information on the multitude of variables inherent in each instruc- 
tional mode which interact to affect both achievement and instructional 
time. Readers who are seriously interested in CAI will certainly want 
to communicate with the author to obtain a copy of the Ci^I material 
for careful examination. 

Perhaps the most serious question to be asked by the reader is, 
"How do I use the results of this research?" 

Can the reader say, "Achievement will be greater under regular 
classroom instruction. Therefore, I should forget about CAI?" 
Hardly. Although this may be true, modifications in the CAI program 
might make it more effective or modifi-cations (such as changing the 
teacher) in the . classroom instruction' might make it less effective, 
thus reversing the results found in this study. Repetition of the 
result found in this study with a different CAI prognsm, a different 
classroom teacher, and a different group of students is problematic. 

Can the reader say, "CAI instruction can be expected to take 
less instructional time than regular classroom instruction?" Not 
really. It is likely that the CAI instruction took less time because 
it was self-paced and the program was written rather efficiently. 
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But it io poaoible to make regular claaoroom instpaction oelf-paced 
as well and it is possible to write CAI progi^^is so inefficient that 
it takes a very long time for otudlento to coi^tece^hem. Once again, 
it is the unknown particulars of each mode of instruction whfch governs 
the time required for completion of the instruction. 



Iiivthe view of • this reviewer > comparative research of this kind 
is of ii6st value to the community ofo science educators when it is 
able to tell us why the observed differences in performance and time 
were found, rather than that they exist. For example, based on the \. 
reviewer's own work, it is often found that differences in means If 
such as those found in this study can be attributed t'o higher per^ 
formancc of one group on a very few items in the criterion test. (That 
is, item analysis* of the test results often shows that the proportion 
of students answering each item correctly will be approximately the 
same for a majority of the items but may differ substantially for 
three or four of the test items.*).. It is often possible to trace this 
difference in performance to unintentional gaps in one or the other 
instructional strategy. Subsequent revision of instruction may then 
result in elimination of the observed differences^ It is the identi- 
fication of such weaknesses yin instruction that will have the greatest 
impact on the improvem^t of ' instruction. 

In similar fashion, we need to know why students were able to 
complete the CAI materials in less than half the time required in 
normal class instruction. Knowing this, it might be possible to 
effect the same economies using strategies that can be applied in more 
conventional classroom settings. If this ±^ possible, the potential 
for instructional improvement is greatly enhanced. 

These comments are in no way intended to demean the work reviewed 
here; rather, they are simply intend^sHfo^suggest the kind of infor- 
mation that is needed in order to make evaluative studies most useful 
to the science education community. 
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Tamir, P. and F. Glassman, "A Practical Examination for BSCS Students: 
A Progress Report . " ^ Journal of Research in Scie nce Teaching, 
Vol. 8, No. 4:307-315, 1971. 

Descriptors— *Achievement, Biology, ^Laboratory Procedures, 
*Problem Solving, Research Design, Secondary School Science, 
*Skills, *Test8 

E^anded Abstract and Analysis Prepared Especially for I.S.E. by 
/Thomas P. Evana, Oregon State University. 



Purpose 

The purpose of this investigation was fqurfold in^i^ature. It 
included the following problem areas: 

a. a comparison of results obtained by administering a 
practical examination in 1969 and 1970 to Israeli 
twelfth grade BSCS biology students; 

b. the development and use of a scoring grid to measure 
specific skills regardless of ^the fact that different 
problems were being administered to different students; 

c. an investigation of the possible relationships that 
might exist among the identified skills; and 

d. a determination of the practical examination's effective- 
ness in discriminating between BSCS and non-BSCS 
biology . students . 



Rationale „ 
•J 

The invest:}.gation was a follow-up and extension of a previous^ 
report ^by the same investigators (1) . It centered around the notion 
that pencil and^ paper evaluations do not measure some important^ 
student objectives that should be an integral part of a BSCS laUora- 
tory-centered approach to the teaching of biology. A further con- 
sideration was 'the fact that although practical examinations have 
been frequently stated as being a desirable component of an evalua- 
tion program for laboratory activities, they have seldom been used 
as evaluation tools by researchers in science education. 



Research Design and Procedure ^ 

Two practical laboratory examinations for use with secondary 
school biology students were developed by the researchers. Each 
examination consisted of 15 percent plant identification with a key, 
-35 percent oral examination on plants and animals, and 50 percent 
problem solving through experimentation. One examination was admin- 
istered in 1969 to 99 Israeli twelfth grade BSCS biology students "as 
a part of their biology matriculation examination. The second exami- 
nation was administered in 1970 to a similar group of 147 BSCS students. 
All students had received an average of four periods of biology 
instruction per week over a period of four years in grades nine, t^n. 
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eleven, and twelve. The data from the examinations were analyzed 
through the use af correlation and a comparison of mean scores. 

Since the problems comprising the examinations were not identical 
Qx0i different students were assigned different problems, a scoring 
grid was devised to improve the assessment of specific student skills. 
Two of the skills, manipulation and self-reliance, were assessed 
through observation during the examination period. The remaining 
skills (observation, investigation, communication, and reasoning) 
were assessed from atudents' written answers. Analyses were made of 
inter correlations of mean scores in the various skills. 

Sixty Israeli twelfth grade biology students who had studied 
conventional biology for four years and who closely matched their 
BSCS counterparts were selected as a comparison group. The mean 
school grades in biology from the BSCS and non-BSCS students were 
75.41 and 75.00, respectively. Two of the three teachers whwtaugKt 
conventional biology had received BSCS training and were teaching 
BSCS biology to other students . The third biology teacher had not 
received BSCS training but had engaged in research at the university 
level. Three problems from the 1969 examination and three problems 
from the 1970 examination were administered to the comparison group 
during their matriculation examination. An analysis of covariance, 
correlation and mean differences in performance were used to compare 
the scores of the 60 conventional biology students with the scores 
of their 142 BSCS counterparts . 



Findings 

The findings reported by the investigators were as follows: 

a. the mean scores of the BSCS students on the 1969 and 
1970 examinations were 71.76 and 73.90, respectively; 

b. intercorrelatlons between the mean scores among the. 
components of the practical examination revealed "^that 
plant identification with a key correlated 0.19 in 1969 
and 0.17 in 1970 with problem solving through experi- 
mentation; performance in oral examination on plants 
and animals correlated 0.26 in 1969 and 0.35 in 1970 
with plant identification with a key; performance in 
oral examination on plants and animals also correlated 
0.44 in 1969 and 0.37 in 1970 with problem solving 
through experimentation; all these correlations were 
significant at the 0.05 level; 

c. achievement on the practical examination showed a 
correlation coefficient of 0.23 in 1969 and 0.39 in 
1970 with student scores on a pencil and paper ach4.eve- 
ment test in biology and a correlation coefficient of 
0.25 in 1969 and 0.33 in 1970 with student grades in 
biology; these correlations were significant at the 
0.01 level; 

d. a scoring key was developed having manipulation (10), 
self-reliance (10), observation (15), investigation (20), 
communication (15), and reasoning (30), as the identified 
skills ; the numbers in the parentheses represented the 
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weighted values Of each skill in percent of the ^otal 
score; 

e. inter correlations of mean scores among the various skills 
showed investigation correlated 0.01 with manipulation 
and 0.18 with self-reliance; investigation showed a 
significant correlation coefficient at the 0.01 level of 

' 0.33 with observation, 0.41 with communication, and 0.56 
with reasoning; reasoning also correlated significantly 
at the 0.01 level with manipulation (0.40), self-reliance 
^ (0.37)., observation (0.45), and total examination score 
(0.79); and 

f . the BSCS students scored significantly higher at the 0.01 
and/or 0.05 levels than the conventional biology students 

* in self-reliance, reasoning, ^d total examination score. 

Interpre tations 

The following statements summarize the conclusions reported by 
the researchers 2 

a. practical examinations measure some aspects of achieve- 
ment that are not measured by teacher grades or pencil 
and paper tests ; 

"b. students with poor or low manipulative skills are not 
necessarily low in their investigative skills; 

c. a practical 'examination has considerable Impact on 
instruction and may. be used as an agent for instructional 

* changes in a predetermined direction; 

d. BSCS biology students have an advantage over non-BSCS 
biology students in solving problems through the use of ^ 
experiments in the laboratory; and 

e. the BSCS biology course has a higher quality of concept 
learning associated with it than does the conventional 
biology course. 

Abstractor's Notes 

Science educators have often stated the desirability of In- 
cluding practi<ial examinations into comprehensive programs of evalu- 
ation, but the statement is seldom put into practice in the science 
classroom. This research represents an exception. It provides an 
example of the development and implementation of a practical exami- 
nation in biology at the secondary school level. The research further 
illustrates that the practical ex,?mination measured certain aspects 
of student achievement in biology that were not measured by pencil 
'and pape-i^ tests or by teacher grades. 

The research report is useful » but it would have been more use- 
ful to other persons wishing to develop and implement their own 
practical examinations if it had included a discussion of the ration- 
ale and methods used in developing the scoring grid an4 the procedures, 
used in achieving interrater reliability. Hoftswere the skills selected? 
Why did some of the skills receive higher weighted values than others? 
How were the skills defined? Was it difficult to discriminate among 
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the skills? How many practice sessions were necessary before a high 
interrater reliability was achieved? After a period of training] 
could independent raters obtain a relatively high interrater relia- 
bility? A consideration of questions such as these would have been 
useful because the answers are crucial for determining the objec- 
tivity of any. practical examination. 

The researchers came to the conclusion that the practical exami- 
nations they developed had an impact on BSCS biology instruction 
from one year to the next. Similar conciusions have been reached 
by other researchers. However, the conclusion raises some interest- 
iiig areas in need of further research. To what extent can science 
instriiction be altered through the use of practical examinations? 
Can the instruction be changed in predetermined directions? Are the 
changes more ov less permanent until an examination stressing new 
student skills is' introduced? Bid the instruction in fact change, 
or-did the students lobk at last year's examination and modify their 
own behavior independently of the instruction they received in class? 
Remember the examination was given as part of the matriculation ex- 
amination in biology. Students are generally very sensitive to the 
types of questions asked on previous matriculation examinations. 
Systematic observation of classroom behavior might provide some evi- 
dence regarding changes in instruction; It inight also be interest- 

,ing to administer a similar examination the following year to non- 
BSCS students who had been taught by teachers who had not partici- 
pated in the research nor had given the practical examination the year 
before as part of the matriculation examination. These questions 
are not to say that classroom instruction was or was not changed. 

.They are suggesting the need for further evidence before the conclu- 
sion can be fully substantiated. 

A shift in the correlation coefficients that were reported 
between students ' total exaziination scores and school grades in 
biology illustrates another area in need of further investigation. 
The total examination scores of the 1969 and 1970 BSCS biology stu- 
dents showed significant correlations at the 0.05 level or 0.25 and 
0.33, respectively, ^th school grades in biology. In 1970 when 
the practical examination was used in an attempt to discriminate 
between BSCS and non-BSCS biology students, a correlation coef- 
ficient of 0.59, significant at the 0.01 level, was reported between 
mean performance on the total examlnatlLon and mean school grade in 
biology for the BSCS students. For the non-BSCS students the cor- 
relation between mean performance on the total examination and mean 
school grade in biology was 0.28, which was significant at the 0.05 
leveH The researchers mentioned the difference in degree of cor- 
relation when discussing the BSCS and non-BSCS students, but they did 
not offer an explanation for the shift from the earlier correlation 
coefficients. Ti- may have been that the 1970 comparison group of 
BSCS teachers altered their grading criteria as a result of having 
become acquainted with the practical examination in 1969. The use 
of the scoring grid in 1969 but not in 1970 may have caused the shift 
in correlation. Another possibility is that yearly school grades in 
biology for BSCS students were quite different from mean school grades 
in biology for BSCS students. Hopefully, this is not the case as 
school grades i'n biology were used as the covariate in analyzing the 
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differences in performance between the BSCS and non-BSCS students on 
the practical examinatiion. 



The use of matching and /or analysis of covariance can result in 
fairly comparable groups if relevant predictor*^ variables, i.e. pre- 
measures which have an appreciable linear association with the post 
measures, are available. But even when such variables are available, 
the use of matching never assures the researcher that the differences 
between groups on post-measures are exclusively the result of an 
experimental variable. This, then, is the case in the present study. 
In fact, one might even question some of the predictor variables 
which were used. For example,. .matching teachers on the basis of hav- 
ing received BSCS training or engaging in research at the university 
.level does not ensure that the teachers will have similar teaching 
methods. Before such an assumption could be acceptable, it would be 
necessary to systematically observe the teachers while they were in 
the act of teaching. Another example of a questionable variable for 
matching, as w^ll as being a questionable covariate, was school grades 
in biology. They were linearly associated with total student per- 
formance on the practical examination, but the association was much 
greater for the BSCS group than the qpmparison non-BSCS group. In 
conclusion, it should be pointed out that matching and/or the use of 
analysis of covariance to obtain equivalent groups should be used only 
when it is impossible to*set up a true experiment. The researcher 
should acknowledge his or her awareness pf the fact that the use of 
matching and /or analysis of covarianceNto obtain equivalent groups 
does not provide a rigorous basis for as^ssing experimental error I 



1. Tamir, P. and Classman, P. "A Practical Examination for BSCS 
Students." Journal of ResearcK in Science Teaching , 
Vol. 7, No. 2:107-112, 1970. 
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Holman, Klchard, "Student Performance in Lower Division Collegiate 
General Biology Programs in Selected Community Colleges and 
Four-Year Institutions in Oregon." Joum^^l of Research in 
Science Teaching , Vol. 8, No. 2:105-112, 1971. 
Descriptors — Academic Achievement , Biology, *College Science, 
*Comnjunity Colleges, *Critical Thinking, Research, *Student 
Characteristics, *Universities 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by 
Robert E. • Yager, University of Iowa. 



Purpose 

The experiment was designed to study the following questions: 

(1) Is there a significant difference in student performance 
between four-year institutions or among community colleges 
in terms of sub-scores on a test of the principles of 
biology measurin^g the behavioral levels of knowledge, 
comprehension, application, and. a combination of analysis, 
synthesis, and evaluation? 

(2) Is there a significant difference in student performance 
between four-year institutions and community colleges in 
terms of critical thinking ability or in terms of sub- 
scores on a test of the principles of biology measuring 
the behavioral levels of knowledge, comprehension, appli- 
cation and a combination of analysis, synthesis, and 
evaluation? 



Rationale . 

This investigation insulted from a 1967 concern in Oregon for the 
equivalency of educational experiences in science courses between the 
university system and the community college system. Notation was made 
of studies concerned with measuring success of community college 
students compared to native upper division university students. Nota- 
tion was also made of a number of studies concerned with measuring the 
differences in attitudes, socio-economic level, and ability between 
students attending community colleges and those attending four-year 
colleges or universities. 



A study conducted in New York by Kochersberger was cited as a 
major study concerned With reporting on the success of community 
college and, university students in similar science courses. Kochers- 
berger*s study completed in 1965 indicated that there were no signifi- 
cant differences between community college and iiniversity students as 
measured on a common test of principles of biology. He did report* 
that university students received more D & F grades while low-ability 
students performed better in the community, college atmosphere. 




A comparison study conducted at Oregon State University by Denney 
was referenced • y Thes e studies were designed to add more information 
concerning our knowledge of equivalency of similar courses (educational 
experiences) conducted in community colleges and four year universities 



in Oregon. 



Research Design and Procedure v 

A total of 261 students from three Oregon Community Colleges and 
"465 freshmen and sophomore students from four-year institutions were 
selected for use in the study. All students had completed all three 
quarters of a general biology course and all had high school grade 
averages available. Campbell and Stanley's post test only control- 
group model was Selected as the experimental design. -This design is 
designated: 

Rl,2,3, X O1O2O3O4O5 
R4,5 O1O2O3O4O5 

Vhere Rl,2,3 represent the students from three community colleges and 
R4* 5 represent students from two universities. The experimental 
v«feble, X, was Instruction in the biology course in community 
colleges* The students from the four-year institutions served as 
controls. O1O2O3O4O5 represent five test scores from the populations. 
High school grade average and sex were held as covariates . 

,Four "common" ol^Jectives for the biology programs were identified. 
The study focused \ip on two of these: 1) to be able to think critically 
and to evaluate f^cts and data, and 2) to gain an understanding of the 
fundamental faqjts and principles of Science. The Cornell Critical 
Thinking Test,,^3^orm Z, was chosen for^testing of the first objective. 
The author ohose 155 items from Testing and Evaluation in the Biological 
Sciences were concerned with content in biology common to the - 

Courses at "all five Institutions. A critique jury consisting of an 
instructor from each of the five inpti tut ions assisted in limiting the 
test to ^5 items. These items were subsequently divided into sub- 
areas representing knowledge, comprehension, application and a 
combination of analysis, synthesis, and evaluation. 

The two instruments were administered to all students (261 com- 
munity college, and 465 university) at their respective Institutions 
during the same week. Botft instruments were administered during the 
same period with a time limit of 60 minutes assigned to each instru- 
ment- All students completed each instrument in this time frame. 



Findings 



For the first question investigated, concerning differences 
between fqur-year institutions or among comipunity colleges, or^ both, 
the Gauss'-torkof f setup for multiple measurements (analysis of dis- 
persion) yielded the following F-^values for each test and/or sub-test: 
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Critical Thinking 



0.37 



Knowledge 6^28 

Comprehension ^ 3.36 

Analysis 10.58 

Combination of Analysis, 

Synthesis, & Evaluation 6,51 



The F-values for Knowledge, Analysis, and the Combination are signifi- 
cant at the .01 level while the value fqr comprehension is significant 
at the .05 level. , — 

The differences between the" highest and lowest means for community 
colleges were compared with the standard deviations of the regression 
coefficients to determine the location of significant differences. 
The difference among the community colleges was reported to 'be caused 
by a difference between institutions on the knowledge subtest and the 
difference, between institutions on the combined analysis, synthesis, 
and evaluation subtest. Similar differences between highest and lowest ' 
four-year Institutions means were also compared with the standard ^ 
deviations of the regression coefficients. Significant differences 
were found between the high and low .mean scores on all four subtests 
of the principles of biology test (knowledge, comprehension, applica- 
tion, and combined analysis, synthesis, and evaluation). 

For the second question investigated, concerning differences 
between community colleges and four-year institutions, the analysis 
of dispersion yielded the following F--values for each test and/or 
subtest: 

Test F-Valu^ 
Critical Thinking 
Knowledge 
Comprehension 

Analysis 0.01 

Combination of Analysis, 
Synthesis, & Evaluation. 0.16 

The .F-values for the critj.cal thinking test and for the knowledge 
subtest of the principles of biology subtests are significant at 
the .01, level. The F-values for the other subtests are not signifi- 
cant at the .05 or .01 levels. The students at four-year univer- 
sities attain significantly higher scores in critical thinking and 
knowledge o-f biology than do students from community colleges. 



9.35 
29.89 
1.11 



\ 
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Interpretations 

The results of this study indicated that: 

(1) the comniunity college general biology students attained a 
level of success on three of the four^ subtests of the test 
of the principlesT of biology equivalent to that of the 
four-year institution general biology student; 

(2) the conmxinity college student attained a higher level of 
success than the four-year institution students on the 
knowledge subtest of the test of the principles of biology; 

(3) there was a greater difference between the four-year 
institutions than between the community colleges and four- 
year institutions on the subscores of the test of the 
principle^ of biology, and 

(4) the ga^ral biology students attending the four-year inati- 
tutioifs attained higher scores than the community college 
general biology students on the Cornell Critical Thinking 
Test, Form Z . 

The author draws no inferences, suggests no implications, and 
does not relate the findings to the reports of other investigators. 
He does, however, mention some differences which were not mentioned^ 
nor discussed in connection with design. Part of the significant 
difference between four-year institutions appeared attributable to 
one additional hour per week spent in small group discussion with a 
biology instructor at one institution. The author also suggests^ that 
difference in critical thinking ability may be caused by the fact 
that students with more ability in critical thinking attend four-year . 
colleges • The author also suggests that the reason Chat community 
ctllege students were superior to students from four-year institutions 
may be that instructors in more community colleges are concerned about 
equivalency of courses and thereby overemphasize memorization of facts. 

Abstractor's Notes 

A question Is raised concerning the appropriateness of the title 
of the paper. Since' f our-ye^ colleges are held as a control group 
in the design and only the community colleges are being studied, a 
more precise title would' b^ desirable. In a slmila|: vein, a mor^lpre- 
cise description of relevant literatuigs would be helpful. Also, V 
relating back to relevant literature in the interpretation section 
would be most helpful • 

Several comments regarding external validity seem warranted. 
The section of the manuscript headed "discussion"' is confusing. 
Differences among assumptions, generalizations, explorations, and 
conclusions should be made. Possible explanations shoul4 certainly 
not be advanced as conclusions. Mo real "discussion" is included. 
The author should be more careful not to make generalizations beyond 
his sample. This is especially true in summarizing or stating 
specific conclusions resulting from the study. The statement in the 



discussion" ^section that university students' superiority to com- 
munity college students on the critical thinking test may be because 
university students are more capable of critical thinking is unsup-^ 
ported. High school grade average is covaried. Is the author sug- 
gesting that this was not an adequate covariate? 

Some comments concerning internal validity are also suggested. 
Reliability measures pf the Co rnell Critical Thinking Test are low 
for a standardized test. Should not this fact have been noted, dis- 
cussed, and considered? Reliability evidence for^ the biology test 
is desirable since it is specially constructed for this e3q)eriment. 
Were any data collected? The actual design used appears to be 
Campbell and Stanleys' static group comparisons. The posttest only 
control-group design, which is claimed to be used, is inappropriate 
for this analysis since it entails random assignment to the two 
"treatments" (i,e, two and four-year colleges). Motivation, selection 
mortality, and interactions are all potential sources of internal 
invalidity. Another serious question is related to the validity of 
the biology examination. No construct nor criterion-related validity 
is reported. 

Some comments related to data analysis are suggested. The cri- 
teria (i,e. cutoff level) for eliminating items in the construction 
of the biology test are not defined. This seems partiqiiiarly impor- 
tant in view of the problems of the Cornell Critical Thinking Test 
and the results reported nsVag the two instruments. The rationale 
for using the 6auss-Markoff setup for multiple measurement needs' 
further explanation, especially since reference provides no practical 
information on testing hypotheses. The calculation of the degrees^ of 
freedom used seems incorrect. An explanation of why the differences 
between means is compared with the standard deviations of regression 
coefficients is also needed. The study would be greatly improved if 
the number of hours each course met per week were covaried. This 
appears to explain much of the significant differences. Further, the 
author at one point offers this as an explanati9n of difference with- 
out even identifying it as a variable initially. An explanation of 
why sex is covaried should be included. 

^ With such a large sample, a measure of the degree of association 
(6)2) for the F tests would be helpful in determining practical sig- 
nificance of results. Upon inspection, such significance does not 
appear to be too gteat. None of the subfest means vary more than one 
point between two-year and four-year institutions. None ot the sub- 
test means vary more than two points across all schools. With the 
large number of subjects used practical significance may have been 
masked , 

Why was not a pre-test considered? A tighter design would have 
been possible. Assumptions concerning the initial characteristics 
of students enrolled at the two types of institutions would not be 
necessary. Obviously more extensive discussion and interpretation as 
well as more precise conclusions would be possible. The fact that 
the high school grade average for community college students were 
lower suggests problems. Initially the author seems to criticize 
other studies concerned with comparing success with general grade 
averages. Why did he choose high school grade-point averages (and 
sex!) as covariates? 
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Scizo cpecific quootiono and ccni::cnto follow: 

1) Tho notation R X 0 dcao not ref or otrictly to Canpbell and 
Stanley criteria and io *^QoiQo" rather than "oignal." Since 
both groupo received tooting, Xi and X2 would have been inoro 
appropriate. Since no r^dca aooigmnent of the otudento wao 
accOHpliphed, the author actually should have uood the Static- 
Group Conparison (Design 03) . 

2) Wao equivalence "proved?" (page 110 of paper) 

3) Doeo the author uoe between" and "among" in the conventional 
otatictical mannei^ (see page 105) ^ 

4) Tho details of item^nalyoio — dio crimination index (D) cmd 
difficulty (d) certainly need not be included in the body of 
the paper. A footnote reference would have provided tho 
reader with the information without looing the flow of the 
paper . 

* 5) Are Ta^le III and IV (pagea 108 and 109) needed for the 

general reader? They oeem of little value to the otudy andA>r 
to tho queotiono raised. Perhaps greater consistency with 
respect to inclusion regarding descriptions, design, and 
analysis would be an improvement . 

The limitations of the study should be more apparent for tho 
reader. Further, the author needs to show evidence that he is aware 
of these ^imitations. Certainly the generalizations he includes lead 
one to question the manuscript, the conclusions, and the contributions 
to the field. On the other hand, many aspects of the study are clear- 
cut and solid. Soma needed descriptions and explanations havo been 
omitted. Other material seems to have been included for little or no 
reason. Making the conclusions more precise and relating them to 
previous reports^ would be a desirable improvement. 



